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ABSTRACT. Let (X7,o7) be a shift of finite type, and G = aut(o7) denote 
the group of homeomorphisms of Xr commuting with or. We investigate 
the algebraic properties of the countable group G and the dynamics of its 
action on Xr and associated spaces. Using “marker” constructions, we show G 
contains many groups, such as the free group on two generators. However, G is 
residually finite, so does not contain divisible groups or the infinite symmetric 
group. The doubly exponential growth rate of the number of automorphisms 
depending on n coordinates leads to a new and nontrivial topological invariant 
of or whose exact value is not known. We prove that, modulo a few points 
of low period, G acts transitively on the set of points with least o7r-period n. 
Using p-adic analysis, we generalize to most finite type shifts a result of Boyle 
and Krieger that the gyration function of a full shift has infinite order. The 
action of G on the dimension group of ør is investigated. We show there are no 
proper infinite compact G-invariant sets. We give a complete characterization 
of the G-orbit closure of a continuous probability measure, and deduce that the 
only continuous G-invariant measure is that of maximal entropy. Examples, 
questions, and problems complement our analysis, and we conclude with a 
brief survey of some remaining open problems. 
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1. Introduction. Let T be a square nonnegative integral matrix. Following 
Williams [Wi], we associate to T a homeomorphism ør of a totally disconnected 
compact space Xr as follows. If T is r x r, form the directed graph with r states 
or nodes, and with T;; symbols or edges from state 2 to state 7. Let L be the set of 
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symbols of this graph. Then Xr C L% consists of those z = (...,2~1,%0,21,---) 
with the terminal state of z;_, matching the initial state of z; for all 7 € Z. Points 
in Xr may be thought of as infinite trips on the graph. Clearly Xr is compact in 
the topology induced from the product topology on £7, and the shift or: Xr > Xr 
defined by (orz); = 2:41 is a homeomorphism. This dynamical system (X7,o7r) 
is called a shift of finite type or topological Markov shift. Such systems are intrinsi- 
cally characterized as expansive homeomorphisms of totally disconnected compact 
spaces with canonical coordinates [Bo]. They play a prominent role not only in 
topological dynamics [DGS] and coding theory [ACH], but are also crucial to the 
analysis of hyperbolic diffeomorphisms [Sm]. We shall assume throughout that or 
is mixing, or, equivalently, that some power of T is strictly positive. To avoid trivial 
exceptions, we also require T # [1]. 


Let G = aut(or) denote the group of homeomorphisms of Xr commuting with 
or. If p € G, then the fundamental observation of Curtis, Lyndon, and Hedlund 
[H, Theorem 3.4] shows there is an n and a finite block map f: L2"*! — £ so that 
(pz); = f(ti-n,.-.-,Zi¢n). It follows that G is countable, and is discrete in the 
compact-open mapping topology. Despite the finite character of such mappings, 
very little is known about the algebraic structure of G. Hedlund [H] showed that 
for the full k-shift, aut(oj,)) contains two involutions whose product has infinite 
order, and also a copy of every finite group. Ryan [Ry2] showed that the center 
of aut(or) contains only the group © of powers of ør. However, it is still an open 
problem whether the automorphism group of the 2-shift is generated by the shift 
and involutions in the group. Another example of our ignorance is the inability to 
settle the question whether the automorphism groups of the 2-shift and 3-shift are 
isomorphic. 


Recently two new approaches to the structure of G = aut(o7) have been made. 
Boyle and Krieger [BK] used the action of G on the invariant set of periodic points 
for or to construct a nontrivial homomorphism from G to [];>~_, Z/nZ called the 
gyration function, and used this function to study G. Wagoner [Wa2], in analogy 
with A-theory, constructed a nontrivial representation of G by using its action on 
the space of Markov partitions of Xr. Furthermore, he has shown [Wal] that G 
can be modelled by homeomorphisms commuting with a special diffeomorphism of 
the sphere S% for q > 5. 


Our purpose is to study the algebraic properties of G and the dynamics of its 
action on Xr and some associated spaces. The general method used here to con- 
struct elements of G goes back at least to Hedlund, and is usually called the “marker 
method.” Roughly speaking, this method divides the symbols of a doubly infinite 
sequence into program and data, and the automorphism makes the program act on 
the data. The requirement of keeping program and data separated leads to certain 
complications. Special cases of this idea are used in §2 to show that G contains the 
free group on countably many generators, as well as the direct sum of countably 
many copies of Z and of any countable collection of finite groups. However, G does 
not contain a group with unsolvable word problem. Also, every subgroup of G is 
residually finite, implying G cannot contain a nontrivial divisible group, nor the 
infinite symmetric group. The finite type character of the shifts is related to the 
failure of divisibility in their automorphism groups, for in Example 3.9 we construct 
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a subshift whose automorphism group contains Q with 1 corresponding to the sub- 
shift. Automorphisms constructed using markers have finite order, although their 
composition may not. Conversely, we show that any finite-order element in G is 
obtained from a marker construction by using appropriate coordinates (Proposition 
2.6). 

Divisibility of elements of G is discussed in §3. The main question, which remains 
open, is whether an infinite-order element can have nth roots for infinitely many n. 
An argument using Ryan’s theorem on the center of G shows that aut(o;4)) is not 
algebraically isomorphic to aut(oj2}). However, we are not able to decide whether 
aut(o;3)) and aut(o;9)) are isomorphic. In §4 we present an example of two shifts 
of finite type with equal zeta-functions that have nonisomorphic automorphism 
groups. 

The growth of the set G,(or) of automorphisms depending on the central n 
coordinates is doubly exponential in n. In §5 we define the symmetry of or to be 


s(or) = lim sup = log log |Gn(or)|, 
n—oo N 
show that symmetry is a topological invariant, and prove that sh(or) < s(or) < 
h(or), where h(or) is the topological entropy of or. The precise value of s(ør) is 
not known to us for any T # [1]. 

Krieger has associated to or an automorphism T of a countable ordered abelian 
group (Gr, GŁ) called the dimension group. In §6 we outline this construction, and 
show that each p € G induces an automorphism of (Gr, GF, Ty, The main question 
is whether this dimension group representation 6: aut(or) —> aut(T) is surjective. 
In Theorem 6.8 we prove that if the eigenvalues of T are simple and no ratio of them 
is a root of unity, then for all sufficiently large n the map 6: aut(o%) — aut(T”) 
is surjective. The argument uses the fact that if the eigenvalues of T are simple, 
then aut(T”) is finitely generated. The proof of this has the Dirichlet unit theorem 
as its priipal ingredient. Two examples complement the discussion, one of which 
shows that aut(T) is not always finitely generated. 

The period of a point is not altered under an automorphism, so G acts on the 
set Qn of points with least o7-period n. Is this action transitive? In Theorem 7.2 
we prove a strong form of transitivity on Qn for all sufficiently large n. However, 
we give an example with two fixed points which we show cannot be interchanged by 
any composition of finite-order automorphisms. Our analysis of the action of G on 
periodic points implies certain algebraic properties of G, including that none of G, 
G/(G,G], and G/X are finitely generated. We conclude §7 with an example showing 
that the profinite topology on G does not always coincide with that induced from 
the action of G on periodic points. 

Boyle and Krieger [BK] introduced the gyration function of p € G to be the 
number g(p,or)(n) € Z/nZ indicating the total amount of twist given by ọ to the 
orbits of length n. The map g:G — [|> Z/nZ defined by 


glp) = (9(~,or)(1), glp, or) (2),..-) 


is a homomorphism called the gyration representation. They prove that g(o (x), 01x) 
has infinite order for k > 2. Using computer experimentation, we stumbled on the 
fact that g(or,or)(p”) converges p-adically, usually to a nonzero limit that turns 
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out to be transcendental. This is the basis of our proof in §8 that if the product of 
the nonzero eigenvalues of T is not +1, then g(or,or) has infinite order. It is also 
a key ingredient in our proof that ør is not a limit, in the periodic point topology 
on G, of products of finite-order elements (Proposition 8.3). 

The search for nontrivial representations of G leads naturally to studying G- 
invariant sets and measures on Xr. In Theorem 9.2 we prove that if a point is not 
o7-periodic, then its G-orbit is dense. A modification of this argument shows that 
if Y C Z are o7-invariant compact subsets of ør, then Z is in the G-orbit closure 
of Y under the Hausdorff metric on compact subsets. In §10 we obtain a complete 
characterization of the G-orbit closure of a probability measure on Xr. Roughly 
speaking, a measure v is in the G-orbit closure of u when it has enough cumulative 
entropy to accommodate an approximate image of u. The precise formulation is 
given in Theorem 10.1. One simple consequence is that the measure of maximal 
entropy on Xr is the only continuous G-invariant measure. 

This work has benefitted substantially from numerous conversations with many 
people. We would like in particular to thank Ethan Coven, John Franks, Ralph 
Greenberg, Hang Kim, Bruce Kitchens, Neal Koblitz, Wolfgang Krieger, Gopal 
Prasad, Frank Rhodes, Jonathan Rosenberg, Fred Roush, John Smillie, and Jack 
Wagoner. We would also like to thank the Mathematical Science Research Institute, 
the IBM Thomas J. Watson Research Center, and the National Science Foundation 
for their support. 


2. Markers and subgroups. We shall describe a method for building auto- 
morphisms of G = aut(o7), and use this method to construct subgroups of aut(or) 
isomorphic to such groups as the direct sum of any countable collection of finite 
groups, the free group on infinitely many generators, and the direct sum of count- 
ably many copies of Z. However, we show G does not contain a group with unsolv- 
able word problem, and in the next section that it does not contain a nontrivial 
divisible group. We next discuss some elaborations of this marker method used 
throughout this paper, and show that automorphisms of finite order coincide with 
those obtained from a marker construction by using an appropriate symbolic pre- 
sentation of Xr. 

Recall from §1 that the set of symbols £ for Xr is the collection of edges for the 
graph of T, and that points of Xr are just allowed bi-infinite sequences of symbols. 
For z € Xr let z[m, n] = m''n E L"-™*! be the block of coordinates of zx 
from m to n. Let Bn(Xr) be the set of allowed blocks of symbols of length n for 
T. Put B(Xr) = UP, Bn(Xr). Suppose M E€ Bm(Xr) and that D C Bk(Xr) 
is a collection of blocks with MDM ={MDM: D E D} C Bam+k(Xr) such that 
for every D € D the block M can overlap the concatenation MDM in only the 
initial and final segments of length m (this disallows even partial overlaps at the 
ends). Let m be an arbitrary permutation of D. Define the action of a block map 
Pr on z € Xr as follows. For each 7, if zli, i + 2m + k — 1] = MDM, define 
(prr)li, i +2m+ k- 1] = Mr(D)M. Require y, to have no other action. Because 
the blocks from MDM cannot overlap except for the marker M, this is a well- 
defined o-invariant map of finite order, so y, € aut(ør). The correspondence 
T ++ Pr hence embeds the symmetric group sym D of D into aut(o7r). We shall 
show shortly that |D| can be made arbitrarily large by an appropriate choice of M. 
This implies by Cayley’s theorem that every finite group embeds into aut(or), and 
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generalizes Hedlund’s argument [H, Theorem 6.13] from the automorphism group 
of a full shift to that of a shift of finite type. 

Blocks, or collections of blocks, with the kind of nonoverlapping property used 
above play a role for constructing continuous maps similar to that of Rohlin bases 
used to define measurable isomorphisms in ergodic theory |[Sh, Chapter 10]. The 
idea has surfaced in several guises, such as prefix synchronization codes in informa- 
tion theory [G]. 

DEFINITION 2.1. Two blocks overlap if an initial segment of one coincides with 
a terminal segment of the other. A collection of blocks in B(X7) has only trivial 
overlaps if distinct blocks do not overlap and each block overlaps itself only in the 
entire block. 

The following gives an ample supply of blocks with only trivial overlaps. 


LEMMA 2.2. There is a collection M = Up, Mn C B(Xr) such that Mn 
contains n blocks of equal length, M has only trivial overlaps, and 


MM ={MM': M,M’ € M} c B(Xr). 


PROOF. Since ør is mixing, and T # [1] by our convention, there must be a 
loop totı -ikio E B(Xr) of distinct symbols with k > 1. Furthermore, one of 
these symbols, which we can assume is 2, is followed by a symbol 7; Æ 7. 

First suppose 7; Æ tọ. Choose a path of minimal length from Jı to the loop, 
Say J1J2'''Jris. The case r = 0 is possible and corresponds to J} = ts for some 
s # 0,1. Define A = 29:--t%, B = t071J2°*-Jrts:-th E B(Xr). Forl<q<n 
define Mng = A?BYAB"~%*!. Noting the positions of to in these blocks, it follows 
from the above minimality of paths that M = {Mng: 1 < q < n,n > 1} has only 
trivial overlaps, and that MM C B(Xr) by construction. Since the lengths |Mnql 
are equal for 1 < q < n, the collections Mn = {Mng: 1 < q < n} satisfy the 
conclusions. 

The remaining possibility is for 7; = tọ. In this case let A = tı ---%929, and 
put My, = A772 Aig ?**. Again noting the positions of ig in the Mng shows that 
Mn = {Mnq: 1 < q < n} for n > 1 satisfy the conclusions. O 

We shall say that G = aut(or) contains a group H if there is an isomorphism 
of H to a subgroup of G. Using the markers constructed in Lemma 2.2, we will 
show that G contains several kinds of infinite groups. For clarity, the constructions 
are first carried out on convenient full shifts, then extended to general ør by a 
substitution map. 


THEOREM 2.3. The group aut(or) contains the direct sum of every countable 
collection of finite groups. 


PROOF. We first obtain the embedding when ør is the full 3-shift on {0,1,2}, 
then extend to general ør by a substitution map using markers from Lemma 2.2. 

First suppose Xr = {0,1,2}4, and let D, = {0,1}", M = 2. For 7 € sym(D,) 
define Yr € G using blocks MD,,M = 2D,,2 as above. This yields an embedding of 
sym Dn to a subgroup of Hn of G. Since (prz); = 2 iff z; = 2 and blocks from 2D,,2 
can overlap those from 2Dm2 only in the end symbols when n # m, elements of 
H, commute with those of Hm for n # m. Thus G contains QP, sym Dn, which 
clearly contains the direct sum of every countable collection of finite groups. 
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For general or, use Lemma 2.2 to find three markers Mo, Mı, and M2 of equal 
length with only trivial overlaps, such that M;M,; € B(Xr) for all i,j. If a € 
sym D,, = sym{0,1}", define p, to replace a block of the form M2M,, ---M;,, Mo, 
where 21 °:-% E Dn, with MaM; ++- Mi, Mo, where m(t1...tn) = Jit- Jn, and 
have no other action. The nonoverlapping nature of the M; shows that Øp is 
well-defined, that 7 + P, embeds sym(D,,) into aut(o7), and that the embedded 
subgroups commute. The proof now concludes as in the first case. O 


THEOREM 2.4. The group aut(or) contains the free product of any finite num- 
ber of 2-element groups. Thus it contains the free group on two generators, hence 
the free group on a countable number of generators. 


PROOF. We embed the free product of three copies of Z/2Z, the generalization 
to more copies being routine. We first work on a special full shift, then carry this 
over to a general or. 

Let the alphabet be £L = {0,1,2,3,*}, and og be the full shift on £L. Define 
involutions p; for 7 = 1,2,3 as follows. All will be 2-block maps, and each will 
exchange three pairs of 2-blocks. Specifically, p; exchanges s0 with sj for s € 
£\ {0,7}. Thus each p; uses three markers for its definition, and has the important 
property that markers defining its action are not affected by it. It follows that 
each p; € aut(oz). Let P be the free product of the 2-element groups {e,7} for 
J = 1,2,3. Define a homomorphism from P to aut(oz) by mapping a reduced word 
Ww = Jn: Jı E P to Y = Pint Pj. Since each p? = J, the identity, this is well- 
defined. Consider the point z = ---0000+0000---, with zo = *. Then (W2z)n = Jn, 
(p> Wx) n—1 = Jn—2 and so on. This means that inductively ~ determines the 
spelling of w, so this mapping embeds P into aut(oz). It is elementary group 
theory that P contains the free group F> of two generators [MKS, §1.4], and it is 
known [Ro, Theorem 11.27] that the commutator subgroup of F> is the free group 
on a countable number of generators. 

This idea generalizes to arbitrary or by using markers instead of symbols. If 
Lr is the alphabet for ør, for each a € {0,1, 2,3, *} use Lemma 2.2 to construct a 
marker M, over Lr, all of equal length with only trivial overlaps, and beginning 
with and followed by to. Define involutions y,;, 1 < 7 < 3, to exchange M,Moio 
with M,M,io for s € £\ {0,7}, and have no other effect. Since these markers have 
only trivial overlaps, the p; are well-defined. The argument that they generate the 
free product of three copies of Z/2Z is exactly as before. O 

REMARK 2.5. This theorem shows that G is not amenable. 


THEOREM 2.6. The group aut(or) contains the countable direct sum of copies 
of Z. 


PROOF. We first perform the embedding when ør is the full shift on the alphabet 
L = {0,1,a,6,c}, then generalize. Let Mn = ab"c. Then M = {M,: n > 1} has 
only trivial overlaps. For each n > 1 we will define two involutions ay ,G,, and then 
put Yn = nln. The idea behind this construction is contained in [L2], where it is 
used to construct automorphisms with interesting entropies. Define a, to switch 
MnijMn with MnjiMn, where 2,7 € {0,1}, and to have no other action. Define 6, 
to map riMnjs to J Mni, where i, 7,7, s € {0,1}, and Mn does not move. Declare Bn 
to have no other action. Clearly a, and n are well-defined involutions that do not 
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move any M,, and only affect those symbols 0 and 1 adjacent to the appearances 
of M, and that are not adjacent to markers with subscript distinct from n. If 
Yn = Ann, it follows that the pn commute, but have no other relations. The 
action of pn on the point (M,,00)°(M,,01)(M,,00)© is to shift the block 01 to the 
left by |Mn| +2, proving that pn has infinite order. In fact, each pn has topological 
entropy log 4 [L2]. Thus the subgroup of G generated by the pn is isomorphic to 
the countable direct sum of copies of Z. 

The generalization to arbitrary ør using markers Mo, Mı, Ma, Ms, and Me 
should now be routine. O 

This method allows the embedding of many kinds of countable groups into G. 
But is there a reasonable answer to the following? 


PROBLEM 2.7. Characterize the subgroups of aut(or). 


At least two properties of countable groups prevent them from being embeddable 
into aut(ør). One is the lack of residual finiteness, which we consider in the next 
section. For the other, recall that a finitely presented group is said to have solvable 
word problem if there is an algorithm to decide whether a word in the generators 
represents the identity. There are countable groups without this property [Ro, 
Chapter 12]. We are endebted to Bruce Kitchens for the following observation. 


PROPOSITION 2.8. The group aut(or) contains no finitely generated group with 
unsolvable word problem. 


PROOF. Suppose a subgroup K of G = aut(o) has n generators. The inverse of 
an automorphism is explicitly computable, if only by trying all block maps using 
coordinates from —k to k and increasing k until the inverse is found. Say that p € G 
has range at most mif (pz); depends on only z;_m,...,Zi+m. There is an m so that 
all the generators and their inverses have range at most m. Then a word y of length 
r in the generators and their inverses has range at most rm. As a block map, w is 
explicitly determined by the block maps inducing the generators and their inverses. 
To check whether y = J, it is only necessary to see if Y(£-rm,...,£rm) = Xo for 
all allowed blocks of length 2rm + 1, a finite procedure. Thus K has solvable word 
problem. O 

That a finitely presented subgroup of aut(o7) has solvable word problem follows 
from the residual finiteness of aut(o7), considered in the next section. Proposition 
2.8 is stronger. There exist finitely generated residually finite groups with unsolv- 
able word problem. For these facts see Theorem 4.6 in Chapter 4 of [LS] and the 
remarks that follow. | 

In each of the constructions above we have used a version of the marker method. 
By this we mean that the automorphism permutes certain blocks when they occur 
in the context of certain finite marking patterns, so that the marking patterns 
are not altered by the permutation. A useful point of view is that these marking 
patterns act as “program” on the “data” of blocks to be permuted. Invariance of 
the marking patterns is a reflection of the necessary separation of program from 
data. For the automorphisms p, constructed at the beginning of this section, the 
marking pattern is a pair of M’s separated by k symbols, while the data is the 
collection D of blocks permuted by mr. In more elaborate constructions the marking 
patterns can be quite complicated (see the proofs of Theorem 9.2 and Lemma 10.7), 
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but the automorphisms produced will have finite order. At the other extreme, an 
empty set of marking patterns corresponds to an automorphism that permutes 
symbols. The following result, apparently first observed by John Franks, shows 
that the finite-order elements of aut(o7) are precisely those that are obtained from 
a marker construction on a conjugate shift. 


PROPOSITION 2.9. Suppose p € aut(or) has finite order. There is a shift of fi- 
nite type oy and a conjugacy Y: Xr — Xu so that pp! is a1-block permutation 
of symbols in Xy. 


PROOF. Let M be the partition of Xr into sets {xz E€ Xr: to = a} fora E€ Lr. 
Suppose y* = J, and put P = Vizo pI Py. Note that p permutes the atoms 
of P. Since P has atoms that are compact and open, there is an n > 1 so that 
Vj=-no P refines P. Let Pi = Vj=-no 7P. Then A is a compact open 
partition of Xr refining A, and a standard argument [B1] from symbolic dynamics 
shows that if n is sufficiently large, P} is a 1-step Markov partition for ør with 
transition matrix, say, U indexed by the atoms of Pi. This gives a conjugacy 
p: Xr — Xu. Since p commutes with ør, it will permute the atoms of P. Hence 
wpw—! acts by permuting the symbols of Xy. oO 


3. Residual finiteness and divisibility. In this section we will prove that 
G = aut(or) is residually finite. Since this property is inherited by subgroups, 
it will follow that G does not contain nontrivial divisible groups, nor the infinite 
symmetric group. We then discuss some divisibility properties of G. We conclude 
with a construction of a subshift whose automorphism group contains a copy of the 
rationals. 

Recall [MKS, p. 116] that an abstract group H with identity I is called residu- 
ally finite if the intersection of all its normal subgroups of finite index is {J}. This is 
equivalent to H having enough homomorphisms to finite groups to separate points, 
and also to being able to embed H into a product of finite groups. The profinite 
topology on H is the coarsest making all homomorphisms from H to finite groups 
continuous. Then H is residually finite exactly when the profinite topology on H 
is Hausdorff [MKS, Problem 2.4.24(a)]. Clearly a subgroup of a residually finite 
group is itself residually finite. 


THEOREM 3.1. The group aut(or) is residually finite. 


PROOF. Let Qn = Qn(or) denote the set of points in Xr with least or-period 
n. Since or is mixing, each Qn is finite. An automorphism p € G = aut(o7) is a 
topological conjugacy of or with itself, hence preserves Qn. Thus for each n > 1, 
an automorphism p induces a permutation y|g, in the symmetric group sym Qn. 
Let Kn denote the kernel of the map y+ l|g,,. The Kn are normal in G, and since 
UFZ; Qn is dense in Xy, it follows that N32; Kn = {I}. Hence G is residually 
finite. O 

A group D is divisible if every element has roots of all orders [MKS, §6.2]. A 
consequence of residual finiteness is that complete divisibility cannot occur in G. 


COROLLARY 3.2. The group aut(or) contains no nontrivial divisible groups. 


PROOF. Suppose D is a nontrivial divisible subgroup. Then D is residually 
finite by Theorem 3.1. Let p # I be in D, and N be a normal subgroup of finite 
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index in D with p ¢ N. Put n = |D/N|. Suppose there were a w in D with 
p” = p. Then N = YN)” = W"N = oN, contradicting p € N. O 

To point out the role that finite type plays in Corollary 3.2, we construct in 
Example 3.9 a subshift not of finite type whose automorphism group contains Q. 
This construction can be amplified so the resulting automorphism group is exactly 


Denote by Sæ the group of permutations of the natural numbers fixing all but 
finitely many elements. J. Wagoner has raised the question of whether G contains 
Soo. The following negative answer has also been found, independently, by Kim 
and Roush. 


COROLLARY 3.3. The group aut(or) does not contain Soo. 


PROOF. If A. is the infinite subgroup of Sə consisting of the even permuta- 
tions, then A, is the union of the finite simple alternating groups, so is also simple. 
If Sœ were contained in aut(o7), then by Theorem 3.1 the subgroup A, would 
also be residually finite. But A. is infinite and simple, so is not residually finite. 
B 

Corollary 3.2 shows for example that G does not contain Q or Z(p™) = Z[1/p]/Z 
for primes p. However, the proof does not rule out partial divisibility. 


PROBLEM 3.4. Is Z[1/p| contained in aut(or) for any prime p? 


This amounts to asking whether there is an automorphism of infinite order with 
an infinite chain of pth roots. Indeed, we are unable to decide the following. 


PROBLEM 3.5. Is there an automorphism in aut(or) of infinite order having 
an nth root for infinitely many n? 


Note that if p is such an automorphism, then it cannot be topologically conjugate 
to a mixing shift of finite type. For if p = oy and Y” = p, then y is also a mixing 
shift of finite type ((BK, Lemma 2.5] or [L1, Theorem 8]), hence w is conjugate 
to some oy. Then the spectral radius Ay of U would be a Perron number [L1, 
§1] with nth root Ay, which is also a Perron number. But Ay has only finitely 
many nontrivial factorizations into Perron numbers [L1, Theorem 4], so it has only 
finitely many Perron roots. 

Although we cannot characterize the subgroups of Q contained in G, there is a 
complete answer for subgroups of Q/Z. 


PROPOSITION 3.6. A subgroup of Q/Z is contained in aut(or) iff its p-torsion 
subgroup is finite for every prime p. 


PROOF. Recall that Q/Z = Ð, Z(p) is the primary decomposition of Q/Z 
[Ka, §3]. If H C Q/Z, then H has primary decomposition H = @, Hp with 
H, C Z(p™). If some H, is infinite, then it is Z(p°°), and then this divisible group 
would be contained in G, contradicting Proposition 3.1. If all the H, are finite, 
then H is contained in G by Theorem 2.3. O 

We now turn to examining chains of roots of the identity J. If {n;: 7 > 1}isa 
sequence of integers n; > 1 for all 7, call a prime p good for the sequence if it divides 
at least one, but only finitely many, of the n;. Call p bad if it divides infinitely many 
of the n;. Some primes may be neither good nor bad. We first discuss the case 
when the roots generate a finite subgroup of G. 
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PROPOSITION 3.7. Let (ny: J 2 1} be a sequence with nj > 1. Then there are 
p; E€ aut(or) with po = I, py = pj—ı for j = 1, and which generate a finite 
nontrivial subgroup of aut(or) iff {n;} has a good prime. 


PROOF. First suppose there are p; having the properties mentioned. By skip- 
ping to the first p; # I and adjusting indices, we may assume pı # I. Let p be 
a prime dividing the order o(p1) of pı, and ® denote the group generated by the 
pj. If p were bad, then pı would have a p*th root in ® for every k > 1. This 
contradicts finiteness of ®. Since p | 0(~ 1) and o(p1) | nı, it follows that p is good. 

Conversely, suppose p is good. Choose the largest jo so that p | nji. Let be a 
pth root of J, say constructed using Theorem 2.3. Define p; = I for0 < j < jo—1, 
Yj. = Y, and put p; = YÒ for 7 > jo, where k = (nj,41°--n;)~1 mod p. An easy 
calculation shows these p; work. O 


PROPOSITION 3.8. Suppose (nj: J > 1} is a sequence with n; > 1. Then there 
are p; E aut(or) with po = I, p’ = p;j—ı for j > 1, and which generate an 
infinite subgroup of aut(or) iff {n;} has infinitely many good primes. 


PROOF. First suppose there are p; as described, and let ® denote the subgroup 
of G = aut(or) they generate. Since ® is a union of cyclic groups, it is abelian. 
Suppose p divides o(y,). If p were bad, then every element in ® would have a pth 
root. This would force G to contain Z(p™), contradicting Proposition 3.1. Thus 
every bad prime is relatively prime to every o(y,). If {n;} had only finitely many 
good primes, then there is a Jo so that for 7 > jo each n; is a product of bad primes. 
Thus (o(n;), nj) = 1 for j > jo, so 0(~;41) = 0(9;”) = o(~;) for ïj > Jo, implying 
® is finite. This contradiction proves {n;} has infinitely many good primes. 

Conversely, suppose {n,;} has infinitely many good primes, say pı < p2 < -:: 
Since each p; divides only finitely many nx, by passing to a subsequence we can 
assume there are ni, < Ni, < --- such that p; | ni; for j > 1. Put 79 = 1, and 
let Mmj = ni,_,41---7i,- Clearly it suffices to find a chain of roots for the m; that 
generate an infinite subgroup. 

Since p; is good, for each J there is a d; so that m,---m, # 0 mod po’ for 
all k > 1. Let H; = Z/p#Z, and put H = @%, Hj. Let ajo = pj’? € Hj. 
Inductively we can find a;,% € H}; so that mj440;,4 = ajk—-1. Let bj = ai j—1 + 
a2, j—2+:: +a; € H. Then the b; generate an infinite subgroup of H, and recalling 
that p; | m; we find that m;b; = b;-1. By Theorem 2.6, H embeds into G. If pj 
is the image of b; under this embedding, then the p; satisfy the requirements. O 

In Corollary 3.2 we proved that the automorphism group of a shift of finite 
type cannot contain a divisible group such as Q. If we drop the “finite type” 
hypothesis, divisible subgroups are possible. The following construction yields a 
minimal subshift (X,o) and an embedding of Q into aut(X, 0). 

The details of Example 3.9 are intricate, and the reader may wonder whether a 
more “natural” action of Q would suffice. Unfortunately, because Q is not locally 
compact, most natural actions of Q fail to be expansive, so cannot yield subshifts. 
For example, Q acts on {0,1}, but individual elements of this action are not 
expansive. Expansive maps with roots of arbitrary order are harder to come by. 


EXAMPLE 3.9. A minimal subshift (X,o) and an embedding of Q into aut(X, 0) 
so that 1 E Q corresponds to o € aut(X, 0). 


License or copyright restrictions may apply to redistribution; see http://www.ams.org/journal-terms-of-use 


THE AUTOMORPHISM GROUP OF A SHIFT OF FINITE TYPE 81 


We first sketch how to construct a subshift (X,o) with an nth root. Suppose 
there are n symbols ao, a@1,...,@n—1, and one “spacer” symbol s. Further suppose 
that allowed blocks in X have an-ı always preceded by an s, and ag always followed 
by an s. Define a block map by y(a;) = a;41 for 0 < j < n—2, and p(san-1) = 
aos. In general, p(X) need not be X. However, if X is designed so p(X) = X, then 
p gives an automorphism of (X,o). As the iterates of p act on x € X, different 
parts of z are moved left one position at different iterates, much like the familiar 
slinky toy. The cumulative effect of p” is to move every symbol to the left once, so 
p is an nth root of ø. 

Before giving the detailed construction, let us describe the role of the objects and 
maps obtained. The construction will proceed by stages, starting with an initial 


alphabet £. At stage q > 3 we will have q! words wP, — ua from CL forming 


the set W(9). Each wl will be a concatenation of words from W(1-1) separated 
by 0, 1, or 2 spacer symbols s. The subshift X will consist of all z € LŽ so that 
every subblock of zx is also a subblock of some constructed word. Every word from 
w4-1) will occur in every word from W 9), and s3 will never occur in any word. 
From this it will follow that every allowed block in X occurs syndetically, so (X, oø) 
will be minimal. It will also follow that if z € X and q > 3, then z[—o0, oo] can be 
uniquely decomposed into a concatenation of words from W 9% separated by 0, 1, 
or 2 spacers s. For each q > 3 there will be maps pW) (3 < k < q) defined on words 


in W(9). They will have the properties that (pD) = oy, for 4 < k < q, that 
(y$)§ = ø, and the consistency condition that p{? applied to a word in W(t") 


gives the same result as ptt) would. Thus on X the pio (q > 3) consistently 
define a block map px, and these obey yf = pp-1 (k > 4), y$ = o. Hence mapping 
1/k! to px for k > 3 embeds Q into aut(X,o), with 1 corresponding to ø. 

To begin the construction, let the alphabet be L = {aọ,a1,...,a5,8}. For the 
initial stage q = 3, put wl?) = a; (0 < j < 5), and ws) = {u ; 0O<j7 < 5}. 
Define p?)( w?) = = w®, (0 < j < 4), and pi? (swf) = we) s, This is the 
method outlined in the first paragraph to obtain a 6th root of o, so (p?)6 == 0, 
and we need only make sure that yp?) (xX) =X 


Next we give the first inductive step, to q = 4. Begin by defining for 0 < m < 3 
the 4 words 


3 3 3 
w(t) = wP (swf)? [swsw moa 6) “+ (whs moa 6)| (swh?)*, 
We then obtain the 4! words in W 4) by defining 
wm = (eS) UW) (O<r<5, 0< m3). 


Note that every word in W(°) egy in every word in W(4). Next put py” (w! (2) )= 


wh (0<7 < 4!- 2), and pf *) (swt) J= wh) s. Then let yo = (p (4) 4. Words 


from W4) are cyclically moved by p$ 4) but break into 4 groups of 6 each on which 
os ) acts exactly as p. Thus yp) is consistent with py), and (p (4))4 = p$” by 
construction. Since every word from W(3) occurs in every word from W(4), and s3 


never occurs here, the minimality conditions are satisfied at stage q = 4. 
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Suppose at stage q — 1 we have defined (q — 1)! words wit?) fr 0 < j < 


(q— 1)! — 1 and maps pit) (3 < k < q — 1) so that (l (q4— Dyk = _ pl) a ad 
(3) (q—1) 


the Pk s.. Op are consistent. Construct stage q as follows. First define, for 
0<J<q-l1, 
(q—1)!—1 
=i =) Re —1 1 
wid) = wh! (sw! Ng I] Bs wes (q—1)! (swe? a. 


i=0 
We then obtain the q! words in W‘% by putting 


wm = (OG?) (w) (0<r<(g-1!,0<m<q-1) 
Next define pio by p (w = wit) for0 < j < q!—1, and py? (sw\? ,) = wP s. 
Finally, set 

pto = (pi ale (3<k<q). 


(p) (a) 


Then, as in the q = 4 case, for fixed k each p% (k < p < q) is consistent with p, 


and (p (a) )k = pi. Every word in W(971) occurs in every word of W., and s3 


never occurs. Furthermore, because of the repetitions of swf T” at the ends of the 


wh) every z E€ X has z[—00, co] decomposed uniquely as a concatenation of words 


from W(1) separated by no more than 2 s’s. This completes the construction of 
Stage q. 

The W‘% and p9 constructed obey the conditions described in the second 
paragraph, and we thus obtain the required minimal subshift (X, o) and embedding 
of Q into aut(X, o). O 

REMARK 3.10. In our construction of we. the words from W(971) are cycli- 


cally listed once. However, any arrangement of words from W'%~!), with arbitrary 
repetitions, would also work, provided each word is used at least once. By using a 
long and highly recurrent listing of words from W‘?-"), it is possible to construct a 
uniquely ergodic subshift (X,o) so that aut(X, o) = Q, and such that every Borel 
measurable mapping p: X — X commuting with ø is continuous. This should be 
contrasted with the uniquely ergodic Morse minimal subshift, whose automorphism 
group is just Z@(Z/2Z) with generators the shift and coordinate complementation 


(see [CK, C]). 


4. Nonisomorphic automorphism groups. Conjugate subshifts of finite 
type have isomorphic automorphism groups. Also, clearly aut(or) = aut(o; "Y; 
Since there are shifts of finite type not conjugate to their inverse [PT, Proposition 
3.30], there are nonconjugate shifts of finite type with isomorphic automorphism 
groups. However, the following question remains open. 


QUESTION 4.1. Ifoy is not conjugate to or or oz", can aut(oy) and aut(or) 
be isomorphic as abstract groups? 


In this regard, we know of only one useful invariant, namely the theorem of 
Ryan [Ry1, Ry2] that the center of aut(ør) is precisely the set of powers of or. 
Thus, for example, as an abstract group aut(oj4]) = aut(o/)) has center generated 


by an element with a square root in the group, while aut(o;)) does not since the 
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2-shift has no square root. The following example, a more refined application of 
this idea, shows that the isomorphism class of aut(ør) is not determined by the 
zeta-function. The motivation for the specific matrices used lies in a consequence of 
[B2] that there is only one shift equivalence class over Z for matrices whose nonzero 
spectrum is {1,2}, while there are exactly two such classes for {1,8}, only one of 
which can correspond to the cube of the first class. 


EXAMPLE 4.2. Two mizing shifts of finite type with equal ¢-function and having 
nonisomorphic automorphism groups. 


Let 


er. _|7 6 


< 

lI 
= O O =e 
or = © 
pi m & © 
O O m Fe 


Then ¢r(t) = su(t) = [(1 — 8t)(1 — t)]. Clearly the generator ør for the center 
of aut(o7r) has a cube root oy. Suppose oy has a cube root p in aut(oy). Then 
p is again a shift of finite type ([BK, Lemma 2.5] or [L1, Theorem 8]), say p = 
ow. Since oy = (ow)? = ows, the nonzero spectrum of W is {1,2} counting 


multiplicity. Hence by [B2], there is an integer j so that W is shift equivalent over 


Z to bel = W,. Since 
W, = 1 0 2 0 2 0||1 Of _ {2 0 
a a ee a oe ale al toi 


and 
we would have 


shift equivalent over Z. Thus there would be matrices R and S over Z and a positive 
integer l with RS = M', SR = N!, NR = RM, and SN = MS. Now NR = RM 
forces Rig = 0 and 7R21 = —Ro2. Also, SN = MS forces Sis = 0. Then RS = M! 
implies R228522 = 1, so Reg = +1. This contradicts 7Ro; =—Ro. O 


5. Symmetry. Let ør be a mixing shift of finite type. As in the proof of 
Proposition 2.8, say that p € G = aut(or) has range at most n if (px); depends 
only on Ti-n,...,Zi+n. A natural measure of the symmetry of ør is the rate of 
growth of the subset Gn (or) of those automorphisms in G with range at most [n/2]. 
This growth turns out to be doubly exponential, so we define the symmetry of or 
as 

s(or) = limsup a log log |Gn (0r )|. 
n—oo 1 

The definition of symmetry depends on a particular presentation of or, but 
the following proves that symmetry is a conjugacy invariant. Suppose ør and 
Oy are conjugate via a mapping ~:X7; — Xy. Choose m so that both y and 
T! have range at most m. If p E€ Gn(or), then poy! € Gni4am(ou), so that 
IGn+4m(ou)| > |Gn(or)|. Thus s(oy) > s(or), and by symmetry they coincide. 
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THEOREM 5.1. Ifo is a mizing shift of finite type, then 


sh(a) < s(a) < h(o). 


PROOF. We first establish the upper bound. Represent o as or with r states, 
and put X = Xr. Now |Gn(or)| is trivially bounded above by the total number 
of block maps By,41(X) — {0,...,7 — 1}, or r!8n+:@Ol, If h(or) = log A, where A 
is the dominant eigenvalue for T, there is a x > 0 so that |Bn41(X)| < kà”. Thus 
IGn(o)| <r", so that s(o) < log à = A(o). 

The lower bound uses the automorphisms constructed at the beginning of §2. 
For this we first need to show that the number of marker-free blocks of length m 
between two markers grows faster than (\!~£)™, where € can be made small by 
choosing long enough markers. 

Fix a small € > 0 and a marker M € B,(X), where n > no with no to be 
determined. Let l be a transition length for T, so that T! > 0. There is an a > 0 so 
that |B,(X)| > aà" for k > 1. If k <n, then M has n — k +1 subblocks of length 
k. Hence if k is chosen so that aA* > n — k + 1, there will be a block C € B,(X) 
that does not appear in M. If no is large enough, a choice of k < $n — l is possible. 

Consider blocks MDM € By (X) of the form 


MECB,CB.---CBxCFM, 


where B; E Bn-2k(X), N = (n — k)K + 2n + 2l + k, and E, F are fixed transition 
blocks in B;(X). Since l is a transition length, every block of Bn-2k-21(X) can be 
the central part of each B;. Hence the collection D of blocks D with the required 
form has cardinality 


ID| > (a7 -2k-21)* > (axes). 


Since 2|C| + |B,| = n, a subblock of D € D with length n must contain C. Now 
M does not contain C, and has only trivial self-overlap. Thus M can only occur 
in MDM as the initial or terminal segment. As at the beginning of §2, distinct 
permutations m € sym D determine distinct automorphisms Yy € aut(o7r) whose 
range is clearly at most N. Thus 


IGen4i(or)| > )(aatt-en) l. 


Now Stirling’s formula implies that log m! > m(logm — 1), which applied to the 
above yields 


log log |G2N+1 | 


1 
See Wn OK Al) Fakti 


slor) > limsu 


log A 


1 — 
log À = hor). 


Since € > 0 was arbitrary, the lower estimate is proved. O 
We do not know the exact value of s(o7) for any T # [1], nor whether the 
definition’s lim sup is actually a limit. 
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PROBLEM 5.2. Compute s(or). 


For full shifts o;,) this problem can be viewed as a quantitative inquiry into the 
relative sparseness of aut(o/,)) in the semigroup of all block maps from Xq) to itself. 
A topological measure of this sparseness has been given by Sears [Se]. We remark 
that for oj}, if we replace “automorphism” by “surjective map” in the definition 
of symmetry, the value is easily shown to be logr = h(oj,)). For any block map 
involving k symbols can be used to define a right permutive block map using k + 1 
symbols by adding modulo r the value of the block map on the initial k symbols to 
the last coordinate. Such right permutive maps are surjective [H, Theorem 6.6]. 

NOTE ADDED IN PROOF. H. Kim and F. Roush have shown that s(or) = h(or) 
for all irreducible T. Their solution to Problem 5.2 uses a modification of the 
construction used to prove Theorem 5.1. 

Let ur be the measure of maximal entropy for or. Coven and Paul [CP] proved 
that ur is y-invariant for every p € G. Since G is not amenable (Remark 2.5), 
the sets G, may provide a replacement for averaging sets for the action of G on 
(X T3: Lr) ° 


QUESTION 5.3. If f € C(Xr), does |G,,|~* 2- peG, T (pT) converge to 


f dur 
Xr 


in any reasonable sense for most or all z E€ Xr? 


6. Induced action on the dimension group. For this section only, we drop 
our standing assumption that T be aperiodic, and assume only that T is nonnegative 
integral and not nilpotent. Building on the fundamental work of Williams [Wi] 
and Elliot [E], Krieger [Kr2] associated to each shift of finite type ør an order- 


preserving automorphism T of an ordered group (Gr, Gp) called the dimension 


group of or. The triple (Gr, G7, T) is a topological invariant since it can be 
defined using only topological notions. Two shifts of finite type are shift equivalent 
exactly when their dimension triples are isomorphic [Kr2]. 

We first review an algebraic description of the dimension triple, and indicate its 
relationship to underlying topological notions. Next we show how an automorphism 
p of or induces an automorphism ó (p) of the dimension triple. The basic problem is 
to determine whether 6 is surjective. We do not settle this. We show in Theorem 6.8 
that if the nonzero eigenvalues of T are simple, and if the ratio of distinct eigenvalues 
is not a root of unity, then for all sufficiently large n the map 6: aut(o#) —> aut(T) 
is surjective. We also show that these hypotheses on the eigenvalues of T imply 


~ 


aut(T) = aut(T”) for all n. Example 6.7 shows that in general aut(T?) can be 
larger than aut(T), though still finitely generated. Also, using an idea suggested to 
us by Gopal Prasad, in Example 6.9 we exhibit a T for which aut(T) is not finitely 
generated. 

Suppose T is an r x r nonnegative integral matrix. For the moment we drop our 
standing assumption that T be aperiodic. It will be convenient to have matrices act 
on the right, and for vectors to be row vectors. Say that v € Q” is eventually integral 
(under T) if vT” € Z” for large enough n. Call two eventually integral vectors v 
and w equivalent if vT” = wT” for large enough n. The set Gr of equivalence 
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classes [v] of eventually integral vectors v inherits an additive group structure from 
Q”. The positive cone GZ is the set of [v] for the vT” > 0 eventually. 

Let R = T"Q” be the eventual range of T. Each class in Gr has a unique 
representative in R, so it will sometimes be convenient to regard Gr as embedded 
in R. Define T on Gr by T([v]) = [vT]. By considering Gr as a subgroup of R, 
the extension of T to R is invertible, so T defines an automorphism of Gr. It 
is clear that T is order-preserving. By the dimension triple of or we shall mean 
(Ceo) 

We now sketch Krieger’s topological construction of the dimension triple. For a 
detailed introduction to these ideas, see [BMT, Chapters 2, 5, 11]. 

By an n-ray we shall mean a set of the form 


t(—oo,n]* = {y E Xr: yj = z; for ~œ < J < n}, 


where x € Xr. By an n-beam we mean a finite disjoint union of n-rays. Note 
that although the notion of n-ray is tied to the presentation T, that of n-beam is 
not, since an n-beam can be topologically described as a finite union of compact 
open subsets of unstable sets in Xr in the natural inductive limit topology. If 
C=U, t)(—oo,n]* is an n-beam, define a vector vC,n € Z” whose ith entry is the 
number of n-rays z such that z?, has terminal state 7. If C is an n-beam, then 
it is also an m-beam for m > n, and vc.m = Uc nT™ ”. Define beams C and D 
to be equivalent if for large enough k we have vc k = vp k. The set of equivalence 
classes generate the positive cone G* of an ordered group J using the definition 
[C] + [D] = [C U D] if CN D = ©. Now ør acts on beams, preserves equivalence, 
so induces an order-preserving automorphism r of G. Since equivalence of beams 
corresponds to equivalence of eventually integral vectors, it is routine to verify that 
the map sending the class [C] of an n-beam to T~"([vc.n]) is an isomorphism of 


(G,G+,@r) to the dimension triple (Gr, GF, T) defined above. 

Therefore, to prove that the dimension triple is a topological invariant, it suffices 
to prove that if ~:or — oy is a topological conjugacy, then ~ maps beams to 
beams, respects equivalence of beams, and intertwines r with oy. The routine 
verifications are omitted. However, we add one note of caution. If Y: Xr; — Xy is 
merely surjective, it is not necessarily true that ~ maps beams to beams. 

We shall denote the group of order-preserving automorphisms of the dimen- 
sion triple by aut(T). Suppose p € aut(or). Since p is a self-conjugacy of or, 
the argument above about topological invariance proves that » induces an au- 


~ 


tomorphism 6(y) € aut(T). A routine calculation shows (py) = 6(~)d(w), so 


6:aut(or) > aut(T) is a homomorphism. 


QUESTION 6.1. Is the dimension representation 6: aut(or) — aut(T) always 
surjective? 
If U is an integral matrix commuting with T, we define U on Gr by U([v}) = [vU]. 


LEMMA 6.2. Suppose U and V are nonnegative integral matrices so that T = 
UV = VU is an elementary strong shift equivalence of T to itself. Then there is a 
p E€ aut(or) with 6(y) =U. 


PROOF. This is mainly a matter of checking that the induced automorphism 
defined by Williams [Wi] acts correctly on beams. 
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Suppose T is r xr. Let W = [76], so W? = [4 7], indexed by states 
{0,1,...,27 — 1}. Then Bo(ow) is the disjoint union of the set By y of paths 
oe and ending in {0,...,7 — 1} and the set By y beginning and ending in 
{r,...,2r — 1}. Thus Xywz is the disjoint union of Xy y defined from By y and 
on defined from By y. Note that ow switches these sets. From the form of W?, 
it follows that there are bijections ĝy v: Bi(or) > Buy and by.y: Bi(or) > By 
that respect initial and terminal states mod r. These induce 1-block conjugacies, 
denoted by the same symbol, 0y,y:(Xr,0r) — (Xu.v,o#,) and 0y u: (Xr,or) > 
(Xv.v,07,). Let p = 07 vow 8uv € aut(or). 

We compute the action of p on a left-infinite ray z(—oo,n]*. Suppose £n has 
terminal state 2. Then #y,y maps this to a ray in Xy,y ending with state t. Next 
ow maps this to a union of rays, with U;; of them ending in state j. Finally, 
Ozu maps each ray ending in state J to one in Xr ending in state 7. Passing to 
tee action of p on Gr, we see the standard unit vector e; is mapped by (p) to 
2 j=1 Uijej. Thus (p) = U. o 

Note that aut(or) is naturally a subgroup of aut(o7%). Furthermore, the re- 
striction of 6: aut(o%) — aut(T”) to aut(ør) coincides with the definition of 6 of 
aut(or). 


PROPOSITION 6.3. Suppose ® € aut(T). Then for all sufficiently large n, there 
is a p E aut(on) with 6(p) = ®. 


PROOF. Let T be r xr acting on Q”. Then R = T’Q” is the eventual range 
of T, and K = ker T” is its eventual kernel. Hence Q” = R@K. As before, we 
may consider Gr as embedded in R. Since Gr is torsion-free, the automorphism ® 
extends to a nonsingular Q-linear map W of Gr ®Q = R. If Ox denotes the zero 
map on K, and Up is the matrix for 0x W with respect to the standard basis, then 
Uo = 0g OW has rational entries, and Uo = ®. Let Vy = 0x @W!. Both Uo and 
Vo have rows in Oe, so for k and n — k large enough, U = UpT* and V = VoT”-* 
have Zt entries. Thus UV = (0x ®Ir)T” = T” = VU is an elementary strong shift 
equivalence of T” to itself. By Lemma 6.1, there is a y €  aut(o7) with 6(p) =U. 
Put p = poz". Then 5(y) = 6(W)6(op*) = UoT*T-* =U =%. o 

To use Proposition 6.3, we will establish some results about finite generation of 
aut(T). In what follows, aut ( Git) refers to the group of those automorphisms 
of Gr commuting with T, not necessarily preserving the positive cone. If U has 
nonzero eigenvalues 41,...,Ax, let xý (t) denote jst — àj). 


LEMMA 6.4. Suppose x7, is irreducible. Then aut(Gy,U) 1s finitely generated 
and abelian. 


PROOF. By passing to the eventual range of U, we may suppose U is nonsingular. 
Let U ber xr, and let A be an eigenvalue of U. By a theorem of Taussky [T], there 
is an ideal J C Z[A] so that (Z”, U) = (J, My), where M) denotes multiplication by 
A. Hence (Gy, U) = (Z[1/A]J, My). For more on this correspondence, see (BMT, 
Chapter 5]. Under this isomorphism, an automorphism © of U corresponds to an 
automorphism of Z[{1/A]J commuting with M), i.e. a Z[{1/A]-module isomorphism. 
Since the quotient field of Z[1/A] is Q(A), there is an a € Q(A) so that ® corresponds 
to the restriction of Ma on Z{1/A]J. Let S be the set of prime divisors of the ideal 
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generated by À, and Os denote the ring of S-integral elements in Q(A). Since both 
Ma and M,-: are automorphisms of Z[1/A]J, it follows that æ is in the unit group 
O of Os. By the Dirichlet unit theorem [We, Theorem 5-3-10], the group Ô% of 
units is a finitely generated abelian group. Thus aut(U ) corresponds to a subgroup 
of Ô% , so is also finitely generated and abelian. O 

REMARK. The Dirichlet unit theorem shows that aut(Gy,U) is the product of 
a finite cyclic group and a free abelian group on e +r + s —1 generators, where 
Q(A) has r real and 2s complex embeddings, and the factorization of the principal 
ideal generated by A uses e distinct primes. 


PROPOSITION 6.5. Ifall the nonzero eigenvalues of T are simple, then aut(T) 
1s finitely generated and abelian. 


PROOF. Factor yr(t) = t™pi(t)---pe(t), where the p,(t) are distinct irre- 
ducibles. If T is r xr, then Q” is the direct sum of the eventual kernel of T and the 
rational subspaces R; corresponding to the p;(t). Let 6; = GrOR;, and T, = T| Gx 
Suppose ® € aut(T). As before, ® extends to a Q-linear map of Q” which is invert- 
ible on <>a R;. Since the p;(t) are distinct and irreducible, each R; is ®-invariant. 
It follows ®|g, is an automorphism of (9;,T T;). Hence the map ® > Or Pje, 
takes aut(Gr,T) to pi- , aut( G,, T. ry. Since pj- , Q; has finite index in Gr, this 
mapping is injective. By using an integral basis for Z” N Rj, the map T; is seen to 
be a dimension group automorphism, so aut(G,, T;) i is finitely generated abelian by 


Lemma 6.4. This proves aut( Gr, T) is finitely generated abelian. The group aut(T) 
of order-preserving automorphisms is therefore also finitely generated abelian. O 


LEMMA 6.6. Suppose the nonzero eigenvalues of T are simple. If T does not 
have distinct eigenvalues whose ratio is an nth root of unity, then aut(T”) = aut(T). 


PROOF. As noted above, aut (T) C aut(T”) is trivial. 

Using the notations from the proof of Proposition 6.5, we see the hypotheses on 
the eigenvalues of T mean that the spectra of T” |r, are disjoint for 1 < J < k. 
Thus any 9 € aut(T”) extends to a Q-linear map under which the R; are invariant. 
Since the eigenvalues of T” on R; are distinct, and ® commutes with T” on R}, it 
follows by linear algebra that ® commutes with T on R;, completing the proof. O 

EXAMPLE 6.7. An aperiodic matriz T for which aut(T?) is larger than aut(T), 
but both are finitely generated. 


Let 

o 2 2 

T= |4 1 4 

0 6 3 

Then over Z[{1/3] we find 

9 0 O 1 1 0 
VUTV=U=j0 3 O|, whereV=/1 0 1 
0 0 -3 P =] =] 


Since the eigenvalues of U are distinct, we obtain Gy = Z[1/3]°, and aut(Gy,U) = 
GL(1,2Z[1/3])* = [Z @ (Z/2Z)]*. Since order-preserving automorphisms of Gr just 
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need to preserve the positive dominant eigendirection, aut( Gr, GŁ, T) is a subgroup 
of index 2 in aut( Ga. 7), The conjugacy of T and U takes place over Z[{1/3], so 
aut(Gu,U) = aut(Gr, T). Hence aut(T) = Z? @ (Z/2Z)?. 

On the other hand, U? has a repeated eigenvalue of 37, and any automorphism 
of the corresponding subgroup of Gy extends to one of Gy. Thus 


aut(Gy,U?) = GL(1,Z[1/3]) @ GL(2, Z[1/3}). 
As before, we conclude aut(T?) = Z ® GL(2,Z[1/3]), which is larger than 


aut(T). D 
We remark that by using elementary matrix operations, one can show that 
GL(2, Z[1/3]), and hence aut(T?), is a finitely generated nonabelian group. 


Assembling the pieces, we now state the main result of this section. 


THEOREM 6.8. Suppose the nonzero eigenvalues of T are simple. If no ratio 
of distinct eigenvalues is a root of unity, then for all sufficiently large n we have 
that 6: aut(o}) > aut(T”) = aut(T) is surjective. If some ratios of eigenvalues are 
roots of unity, the conclusion still holds for infinitely many n. 


PROOF. Let us first suppose no ratio of distinct eigenvalues is a root of unity. 
By Lemma 6.6, we have aut(T”) = aut(T). By Proposition 6.5, aut(T) is finitely 
generated. Using Proposition 6.3, for all sufficiently large n, each generator is in 
6(aut(o7#)), implying the conclusion. 

If some ratios of distinct eigenvalues are roots of unity, the preceding argument 
still holds for all n relatively prime to the orders of these roots of unity. O 

We conclude with an example to show that finite generation of aut(T) does not 
generally hold. 

EXAMPLE 6.9. An aperiodic matriz T with aut(T) not finitely generated. 

We are grateful to Gopal Prasad for the idea behind the following. Let 


oð 2 2 
T=|4 4 1 
0 3 6 


Then T is conjugate over Z[1/3] to 


9 0 0 1 1 0 
U=V'Tv=1|0 3 O|, where V = |1 0 1 
0 3 3 Lal =] 


As in the argument for Example 6.7, aut(Gy) = GL(1, Z[1/3]) x H, where H is 
the group of automorphisms of Z[1 137 commuting with W = & $]. By analogy 
with the KAN decomposition of Lie groups, it is easy to see that HS KxAxN, 


where K = {+1}, A= {3"I: n€ Z}, and 
N= if 4 aezi/s}}. 


Now N is isomorphic to the additive group Z[1/3], which is not finitely generated. 


Thus H is not finitely generated, so aut(T) = Z x H is also not finitely generated. 
O 

The following question appears to us basic to understanding the structure of 
aut(or). 
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QUESTION 6.10. JfT 1s aperiodic, 1s the kernel of the dimension representation 
of aut(or) generated by elements of finite order? 


This question is especially pertinent to understanding the action of aut(o7) on 
the periodic points of ør (see §7 and [BK]). It is a natural generalization of an 
older one, which to the best of our knowledge was first explicitly conjectured by F. 
Rhodes in correspondence with G. Hedlund. 


CONJECTURE 6.11 (F. RHODES). The automorphism group of the 2-shift is 
generated by the 2-shift and involutions. 


7. Induced action on periodic points. Each automorphism of ør maps a 
periodic point to another with the same period. This action of G on periodic points 
was studied by Boyle and Krieger [BK]. We begin by discussing a fundamental 
question about this action, and give a partial answer in Theorem 7.2. We then 
discuss several representations of G provided via periodic points, and conclude 
with remarks about some related topologies on G. 

Let Pa = Pn(or) denote the fixed points of o}, and Qn = Qn(or) be those 
points with least or-period n. Since Qn is or-invariant, we can define aut(Qn, or) 
to be the group of bijections of Q, commuting with or. Let Fn = U=: Q;. 


QUESTION 7.1. When is an automorphism in aut(F,,o07) the restriction of 
one in aut(X7,o7)? 


This question is the natural generalization of a long-standing problem of R. F. 
Williams, namely when can fixed points be switched by an automorphism. The 
latter problem, posed originally as a potential refinement of shift equivalence, has 
withstood serious attacks for a number of years. Example 7.3 below provides a 
concrete case of a shift of finite type having two fixed points which no composition 
of finite-order elements can switch. It cannot be ruled out at present that the 
answer to Question 7.1 is “always”. 


THEOREM 7.2. Let or be a mizing shift of finite type. There is an no(T) so 
that ifn > no and z,y E Qn(or) have disjoint orbits, there is a composition p of 
involutions in aut(or) so that px = y, py = z, and p fizes all points whose orbit 
has length < n and does not contain =z or y. 


PROOF. The proof elaborates ideas from [BK], where this is proved for full 
shifts with np = 1. Let us first describe some convenient notation. Shorten ør to 
O, XT to X, Bn( XT) to Baz and B(Xr) to 8. If A = Q0 '''’'an-1 € Bn, let Alt, J] 
denote a;---a;. If A? E Bon, let A® denote the point x € X with z; = Afi mod n]. 
If 0 <i <n, let o°(A) be the cyclic permutation Afi, n — 1]A[0,z — 1] of A. Then 
o*(A®) = (o A)”. 

First suppose A and B are distinct blocks in B, with A? B? A? € B. Although 
(AB)© need not have least period 2n, we claim there are cyclic permutations 
o*(A), o°(B) so that (o*(A)o*(B))~ € Qn. To prove this, suppose (AB)® has 
least period 2n/m < 2n, so AB = C™ with m > 3 and odd since A # B. Thus 
|C| is even, so write C = DE with |D| = |E| and D # E. Thus A = D(ED)? and 
B = E(DE)? with q = ¿(m — 1). Let A; = o!Pl(A) and Bı = o!?!(B), and put 
F = A,B, = (ED)9"!EDDDE(DE)?—'E. Suppose F® has period k < 2n. Since 
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D # E, we conclude k < n, hence 

k < m = atip < 2q|D|. 
Now choose 7 > 1 such that |D| < 7k < 2q|D|, and translate the central DDD 
block in F by 7k to the left. It follows that this block contains either DE or ED 
as a subblock, implying D = E, a contradiction. Hence F® € Qon, verifying 
our claim. Note that the cyclic permutations A; and B; retain the property that 
A? B? A? € B, so AS, B®, and (A;B,)© are allowed points in X. 

If z,y E Qn, say that p € G switches x and y if pr = y, py = z, and ọ fixes 
all points whose o-orbit has length < n and does not contain z or y. We shall 
write in this case p: zey. Suppose now that A,B € Bn with A?, B? €e B and 
A”, B® € Qn defining disjoint orbits with Afi] = Bfi] for some 0 < i < n. We will 
construct an involution pap: A® B®. First replace A by o*A and B by o*B, so 
now A? B? A? € B. By the above, we can further replace A by o/ A and B by o/B 
so that still A? B? A? € B and also (AB)® € Qan. To define pap, let a frame be a 
word in {A, B}°. If z[i — 2n,7 + 3n — 1] is a frame, define 
B ifzli, +n- 1] = 4A, 

A ifz[i,i +n- 1] = B, 

and require pap to have no other effect. To see that pap is well-defined, suppose 
the contrary. Then there are two frames C and D with C[—2n,3n — 1 — k| = 
D|—2n + k,3n — 1] for some k with 0 < k < n. Since A® €E Qn, it follows that 
A occurs in AA only in the initial and terminal halves, and also B does not occur 
in AA. Thus C and D have the form ABABA or BABAB. The overlapping of C 
and D force (AB)® to have period < 2n, contradicting (AB)® € Qan. Thus gap 
is well-defined. Clearly it preserves frames. It follows that p4pB is an involution in 
G, and obviously it switches A® with B®. 

To complete the proof, let l > 0 be a transition length for ør, so that (T');; > 0 
for allz,7 € L. There is an no = no(T) so that for all a,b E L and n > no, there are 
at least 3n blocks D in Bn satisfying D[0] = a, D|I] = b, and D® € Qn. This follows 
by counting, since the number of n-blocks satisfying the first two conditions is at 
least xà% for suitable x > 0, while the number of n-blocks D with D® having period 
< n is no more than } ajn, a<n |Bal = O(A7/°). Now fix n > no(T), and suppose 
z,y E Qn have distinct orbits. Put A = z[0,n — 1] and B = y[0,n — 1]. By the 
choice of no, there is C € B, with C[0] = A[0], C[/] = B[l], C® € Qn, and the orbit 
of C® missing z and y. By the above, pac: AV” =CÙ® and yac: BY=C%®. Then 
p = pacpBgcpac: A=B”, and no other orbits of length < n are affected. O 

Theorem 7.2, with a smaller estimate for no(T), has been proved independently 
by M. Nasu [N]. 

The following example shows it is possible to have periodic orbits that cannot 
be switched by products of finite-order automorphisms. 


EXAMPLE 7.3. A mizing or with exactly two fized points that are also fixed by 
every finite-order automorphism of or. 


Let 


papBlz)li i +n- 1] = l 


1 1 0 
T= |0 0 1ļ, 
3 0 1 
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and Q:(or) = {z,y}, where z = 0% and y = 2”. Suppose  € G with yë = I 
and y: zy. Now (wor)* = of, so yor is a root of a mixing shift of finite type, 
hence itself is a mixing shift of finite type ([L1, Theorem 8] or [BK, Lemma 2.5]). 
Since the characteristic polynomial of T is irreducible over Q, it follows that or 
and wor have equal ¢-functions. But z E Qe(wor), while Q2(or) is empty. This 
contradiction shows such a w cannot exist. Furthermore, ør has no roots. For 
suppose w* = or. Then as before 7 is a shift of finite type, say y = oy. But then 
the product of the nonzero eigenvalues of U is 3!/*, contradicting integrality of this 
product. In this example, no combination of marker constructions and roots of the 
shift can switch z and y, and it remains open whether this is possible. O 

Recall that Pa = Pa (or) denotes the fixed points of o}, and that Qn = Qn(or) 
is the set of points with least o7-period n. Clearly each p € G restricts to a 
permutation m(p) = plo, E aut(Qn,or). If Qn = Ø, set aut(Qn,or) to the 
trivial group, so then mpn is also trivial. Call 


Oo 
T(P) = (™1(), t2(~),---) € || aut(Qn,o7) 
n=l 
the periodic point representation of p € G. The proof of Theorem 3.1 shows that 
it is faithful. 


PROPOSITION 7.4. The periodic point representation t of G is faithful. O 


The next two representations are derived from 7. Let E = {o}: n € Z}, and 
On = Qn/= be the set of orbits of length n. If p € G, then z,(y~) commutes with 
(ao), so induces a permutation pn(p) € aut(On, or). Call 


OO 
ple) = (mle) plp). --) € [atn or) 
n=1 
the periodic orbit representation of p € G, where again we use the convention that 
the symmetric group of the empty set is the trivial group. Clearly p(o*) = I, but 
» exhausts the kernel of p. 


PROPOSITION 7.5. The periodic orbit representation p of G is faithful on G/X. 


PROOF. Suppose pn(p) = I for n > 1. By Theorem 2.5 of [BK], it follows that 
ypeb. OQ 

We next prove that the range of p is large, in that its closure in the compact 
group [],_, aut(Qn,or) is a subgroup of finite index. 


THEOREM 7.6. The closure of p(G) in[]?-, aut(Qn,o7) is a subgroup of finite 
index. 


PROOF. Let no = no(T) from Theorem 7.2. We assume without loss that 
|On| > 1 for n > no. Suppose n > no, and let 6; € aut(Q;,or) for 1 < j <n with 
6, =I if 1 <j < no. We show there is a p E€ G with p;(~) = 0;, (1 < J < n). It 
will follow that p(G) is a finite union of cosets of Ma aut(Qn,or). Set pı = I, 
and suppose inductively that y; € G has been found with p;(p;) = 6; fori <i <J. 
Since p;41(~,)7 10,41 € aut(Q,41,07) is a product of transpositions, by Theorem 
7.2 there is a product ~;+41 of involutions in G with p;(%;41) =T for 1 <i < 7 and 
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Pj+i(Vj+1) = 0541(7)~*O541- Set 541 = 9;H;41 and continue. Then p = pn 
has the required properties. O 

One implication of this density result is another proof of Ryan’s theorem [Ry2] 
on the center of G. This proof was obtained jointly with W. Krieger. 


THEOREM 7.7 (RYAN). The center of G is ©. 


PROOF. Choose nı > no(T) so that if n > nı, then |Q,| > 3. If p commutes 
with every element in G, then by Theorem 4.5 its restriction p,(y) to orbits com- 
mutes with all of aut(Q,,o7r) for n > nı. Hence pnp) = I for n > nj, so by 
Theorem 2.5 in [BK] it follows that pE X. D 


THEOREM 7.8. The groups G, G/X, and G/[G,G] are not finitely generated. 


PROOF. We show each factors onto an arbitrarily large product of two-element 
groups, which shows each is not finitely generated. By Theorem 4.5 there is an 
nı > no(T) so that for n > nı we have |Q,| > 2 and that (pn,,...,0n) maps G 
onto [[;-n, aut(Q;,or). Denoting the sign of a permutation p by sgnp € {+1}, 
it follows that (sgn pn,,...,SgNn pn) maps G onto {+1}"~"'!. This mapping factors 
through G — G/X and G > G/[G,G] , completing the proof. O 

We now recall the gyration function g introduced in [BK]. To define g, let p E€ G, 
and from each orbit y E Qn pick an element z4 € y. Since p(x) € p(y), there is 
an integer n(7, p) defined modulo n so that p(z4) = oP) a... Put g(y)(n) = 
g(p,0)(n) = doco, 2(7,9), where by convention an empty sum is 0. The value 
of g(y)(n) is shown in [BK] to be independent of the choice of the z}, and also 


g(ev)(n) = g()(n) + g(4)(n) mod n. Call 


glp) = (9()(1), 9()(2),.-. JE Tl Z/nZ 


n=l 


the gyration function of p € G, and call the homomorphism p + g(p) the gyration 
representation. This function is studied in detail by Boyle and Krieger [BK] 

The periodic orbit and gyration representations are related. If 6 € aut(Qn,or) 
commutes with 7,(or) = olg,, its gyration number g(0) € Z/nZ is defined as 
above. Also, let p(@) be the permutation induced by 0 on the orbit space Qn = 
Qn/x. Call a sequence (61,...,9n) with 6; E aut(Q;,o7r) consistent if whenever 
2™q4 < N with q odd, then 


(Goma) 0 mod 2™q if Weer , sgn p(Ooi,) = 1, 
27q) = 
: 2”-1q mod 2™q if Wey sgn p(92;,) = —1. 


In particular, if m = 0 the empty product being 1 means g(#,) = 0. If or is 
a full shift, it is shown in [BK] that a necessary and sufficient condition for a 
sequence (01,...,0,) with 0; € aut(Q,(or),or) to be the restriction 0; = 7;(y) of 
a product p of involutions is that it be consistent. Necessity was also shown for a 
broad class of or. One consequence is that for a product p of involutions, sgn p(y) 
and g(y) determine each other. This is significant since many marker constructions 
are involutions. We show a modified form of the sufficiency condition is true for 
general shifts of finite type. 
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THEOREM 7.9. Let or be a mizing shift of finite type, and fix N > no(T) with 
no(T) as in Theorem 7.2. If 0; E€ aut(Q;(or),or) forl <j <N with 6; =I if 
1 < j < no, and if (01,...,An) ts consistent, then there is a product p of involutions 
with m;(p) =0; fori <j <N. 


PROOF. We first observe that the involution paps in the proof of Theorem 7.2 
can be constructed so that (mı(paB), 7. tN (paAB)) is consistent. Let k = |A|, and 
change the definition of “frame” in the construction from {A, B}° to {A, BYN+1, 
The only points of period < N affected by pap have period a multiple of k, and 
are concatenations of A’s and B’s. Let oz be the 2-shift on {0,1}, and 7 € aut(o2) 
be the 1-block map exchanging 0 and 1. Mapping A — 0 and B — 1 shows that 
Pnk (PAB) E€ aut(Qng(or),or) and pn(7T) € aut(Qn(o2),02) have the same sign. 
Furthermore, g(paB,or)(2nk) = kg(t,02)(2n) mod 2nk. Since pm(yas) = I if 
m # Omod k, and (7(7),...,7w(7)) is consistent from Lemma 3.3 of [BK], it 
follows that (71(~4B),-.-,7N(~4B)) is also consistent. 

To construct p, assume inductively that Ym has been found with 7;(ym) = 9; 
for 1 < j < m and so that (71(m),--.-,7N(m)) is consistent. As in the proof 
of Lemma 3.7 of [BK], there is a product Ym of involutions of the above type 
fixing points with period < m such that tm+41(Wm+1) = m+1(¥7,'9m41) if the 
gyration number of 710,41 vanishes. However, this follows from consistency of 
the 0; =7;(%m). O 

Although G is discrete is the compact-open mapping topology, there are at least 
two other natural topologies on G making it a nondiscrete topological group. The 
first is the periodic point topology, defined as the weakest topology making the 
restriction homomorphisms G — aut(Qn, or) continuous for all n. In this topology 
an automorphism is close to the identity it if fixes all points with period less than 
some large bound. The second is the profinite topology on G, which has as basic 
open sets the cosets of finite-index normal subgroups of G. Clearly the profinite 
topology refines the periodic point topology, and the proof of Proposition 7.4 shows 
both are Hausdorff. Do they coincide? The following example shows the answer in 
general is “no”. 

EXAMPLE 7.10. A mizing shift of finite type such that the profinite topology 
on its automorphism group strictly refines the periodic point topology. 

Let 


0 1 1 
T= }]1 0 1], 
1 1 1 


indexed with states a, b, and c. We will use the dimension group representation of 
G to define a homomorphism @ from G to {+1}, and show that for every n there is 
a Yn E G fixing all points of period < n for which 6(~,) = —1. This will show that 
the profinite neighborhood ker 6 of J in G contains no periodic point neighborhood 
of I. 

Now T has characteristic polynomial xr (t) = (t+1)(t? —2t—1), so —1 is a simple 
eigenvalue of T, with corresponding eigenvector v = [1 —1 0}. Recall from §6 the 
dimension group representation 6:G — aut(G,, G27), Now Zv C Gr, and any 


element p € aut( r, GET) must map v to either v or —v. Let 0(p) be the choice 
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of sign used, so 0:G — {+1} with 6(~)(v) = 8(~)v. Since 6 is a homomorphism, 
so is ô. 

We now show that 0 has the required property. Fix n > 1. The blocks Mı = c"a 
and Mz = c”b have only trivial overlaps. Define p € aut(o7) to switch these blocks, 
and have no other effect. Clearly pn fixes all points with period < n. We compute 
the action of (pn) on v. For j = a,b,c choose the points z? so that 


oe ée ft 0, 
+ zj ifi=0. 
Consider the 0-rays C; = 2)(—00,0]* for 7 = a,b,c. Under the correspondence 
of equivalence classes of beams to classes of eventually nonnegative vectors in Gr 


described in §6, each class [Cj] corresponds to the class of the standard unit basis 
vector ej. Now Pn exchanges the 0-rays Ca with Cy, and fixes Ce. Thus 


0 1 0 

(Pn) =}1 0 0f, 
0 0 1 
and ó(pn)v = —v, so O(pn) = —-1. O 

8. p-adic aspects of the gyration representation. Let us begin by recalling 
some notation from the previous section. We let Qn = Qn(or) denote the points 
in Xr with least or-period n, and Qn = Qn(or) = Qn/= be the orbit space of 
Qn under or. For each orbit y E Qn pick z4 E y. If p E G, then pz, E€ v(4), so 
there is an integer n(y, p) defined modulo n so that yr, = o”) 2... The nth 
gyration number of p is 


g(y,or)(n)= Š nly, ~) €Z/nZ. 


VE On(oT) 


If Qn = Ø, this number is defined by convention to be 0. The gyration representa- 
tion of p is the homomorphism sending p € G to its gyration function 


(© ©) 
glp or) = (glp, or)(1), glp, or)(2),---) € ]] Z/nz. 
n=l 

If oj} is the full r-shift, Boyle and Krieger [BK, Corollary 2.3] showed that 
g(O[r},[r)) has infinite order in []7° , Z/nZ. It follows that Oir) is not a product 
of finite-order elements in G. Indeed, oj, is not even the limit in the periodic 
point topology of such products [BK, Theorem 2.8]. Also, g(o/,) x I, Ofr] X o[5}) has 
infinite order [BK, Proposition 2.4]. Using p-adic analysis, we shall extend these 
results to matrices T for which the product det% T of the nonzero eigenvalues is 
not +1. The crucial fact is that, roughly speaking, for most primes p the numbers 
g(or,or)(p”) converge p-adically to a transcendental limit in the p-adic completion 
Q, of Q. 

To see the computational significance of this convergence, consider Table 1 of the 
gyration function of the 2-shift acting as a self-automorphism, evaluated at powers 
of 3 and of 5. We have expanded the values to the appropriate base. 

The evident stabilization of the digit coefficients for, say, 5" can be expressed 
by saying that g(o(9), 0/2;)(5") converges 5-adically to some a € Qs. To prove this 
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E glota) O21) ( 9(O {2}, (2) 


2=2 1=1 
2=2+0-3 1=1+0-5 


11=2+0-3+1-3? 26=14+0-5+4+1-5? 
388 = 2+0:34+1-374+1-3° 276 =14+0-54+1-5742-5° 
388 =2+0-34+1-374+1-3° 2776 =14+0-54+1-574+2-5° 44-54 


TABLE 1. Values of the gyration function for the 2-shift. 


convergence, and identify the limit, note that since each orbit contributes 1 to the 
gyration function, we have 


|Psn (oia )| — |P5n-1(o/2])| 
hn 
5n 5n-1 5n_pn-l 
pore (if) (Fa). 
5n 5 5n — 5n-1 

As n — oo, we have 25” ' — w(2) € Qs, a 4th root of unity, the so-called 
Teichmüller representative of 2. Also, since 5” — 5"—! — 0 in Qs, the third factor 
converges to the 5-adic derivative of 27 at z = 0, namely the 5-adic logarithm log, 2 
(see [Ko, Chapter 5, §1] for details). Thus (0/2), 0[2})(5") converges 5-adically, with 
limit a = (1 — = )w(2) log; 2. If we denote ap +a15+425?+--- by .agajag..., then 
w(2) = .212134..., $ log, 2 = .330333..., and the product is a = .101240..., in 
agreement with the table entry. Furthermore, log; 2 is transcendental [Br]. Since 
£(2) is algebraic, the limit œ is also transcendental. Since a mod 5” is a unit 
in Z/5"Z, it follows that g(o(9),0;2})(5") generates Z/5"Z for n > 1. Hence oja) 
could not be a product of finite-order elements in G, for otherwise the orders of 
g(0(2}, 0[2})(n) would be uniformly bounded in n. This analysis fails at the prime 
2, since g(o(2), 0(2))(2”) — 0 in Q2. However, 2 is the only exceptional prime. 

It is perhaps interesting to note that the p-adic convergence of the gyration 
function was first discovered by computer experimentation. 

The ideas above extend to more general ør by using the p-adic eigenvalues of T. 


9(9 [2], 9[2])(5") = | sn (oj2))| = 


THEOREM 8.1. Suppose the product of the nonzero eigenvalues of T 1s not +1. 
Then the gyration representation g(or, 07) has infinite order in [ [F] Z/nZ. 


PROOF. Let the characteristic polynomial of T be 
xr(t) =t” (tt + at? 1 +--- +44), 


where ag # +1 by assumption. Choose an odd prime p relatively prime to ag. A 
good account of the basic p-adic analysis used here is contained in [Ko]. Let K be 
the splitting field of xr over the p-adic completion Qp of Q. The — valuation 
| - |p extends uniquely to K. Thus y7(t) factors over K as t™ Ili- i(t — A;). Since 
aq is a unit in the ring of integers in K, each |A;|p = 1. It follows there is an integer 
r > 0 so that |1—Ajlp < 1 for 1 < j < d. We will prove that g(orr,orr)(p”) 
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converges p-adically to a nonzero limit, show this forces g(orr,o7r+) to have infinite 
order, and conclude from this that g(or,o7r) = g(o7r) also has infinite order. 

For arbitrary U, we have |Qn(ov)| = Xam H(n/d)tr Ut, where p is the Möbius 
function, and g(oy,oy)(n) = |Qn(ov)|/n. Hence 


g(orr,orr)(p") = = ftr (T")P" —tr cr} 


re" —p"—*) l 


d 
1 n—1 
ae pi ATP Aea i 
( JE 3 p” — p”! 


Since |1 — Ažlp < 1, it follows that A7” " — lin K. The second factor in the 
sum converges to the p-adic derivative of A5* with respect to x at x = 0, which is 
rlog, àj. Here log, y is the p-adic logarithm defined for y € K with ly — 1ļp < 1 by 
the convergent series log, y = } o n—1(—1)”t! (y — 1)"/n. Thus as n — œ, 


1 
deren ors (3 ? 1) log, a #0, 


the nonvanishing following since ag # +1. This means that as n increases, the p- 
adic expansion of g(o7: , orr )(p”) has low order coefficients that stabilize to nonzero 
>p `, 


values. In particular, if 
1 
1 — — } log, aj 
( a g” 


then p"-*g(or+,o7+)(p") #0 mod p” for n sufficiently large. This clearly implies 
that g(orr,orr) has infinite order. We conclude by showing this forces g(or, or) 
to also have infinite order. First note that g(or,,orr) = g(o7,07) = rg(or, oT). 
The calculation of Proposition 1.6 of [BK] shows that if a; = a;(r,n) € Z is defined 
so ra; = j mod jn when j = (r,jn) and 0 otherwise, then 


k 


9(~,07)(n) = $ a;g(~,or)(jn) modn. 
jlr 

Apply this to p = ør in the above to express g(orr,o7r)(n) as an integral combi- 
nation of the g(or,or)(jn) for j |r. Thus if g(or,o7) had finite order, so would 
g(orr,orr). This contradiction establishes the theorem. O 

REMARKS. 1. If à is a unit in K, the splitting field of xr over Qp, then AP” 
converges to a (pê — 1)st root of unity w(A) in K, where e is the ramification index 
of K over Qp. The above shows that we always have in Q, that 


d 

! 1 rA 

(8-1) Jm a(er,or)(o") = (1- =) log, | ] | Oe 
J= 


However, we cannot conclude from this that the limit is nonzero. By taking rth 
powers, we force w(A5) = 1, so the product in (8-1) is a} £ 1, and the limit does 
not vanish. The general question of whether (8-1) is nonzero hinges on how the 
A; and the corresponding root of unity w(à;) interact. For example, if the à; 
are multiplicatively independent, then the p-adic versions of Baker’s transcendence 
results [Br] show that (8-1) is not zero. 
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2. If the product of the nonzero eigenvalues of T is +1, all that is needed to 
prove the theorem is to find one prime p for which (8-1) does not vanish. Since this 
is the “typical” case, finding such a p can usually be done by hand. For example, 
i = F a] we can take p = 3 and simply check that the product in (8-1) is not 
+1. Indeed, all but finitely many primes will work in this case. 

3. If det* T # +1, the roots of unity w(A;) in (8-1) are merely a nuisance, to 
be eliminated by passing to rth powers. However, when det% T = +1 it is essential 
that at least some of the w(A;) not be 1 to obtain a nonzero limit in (8-1). 

The proof of Theorem 8.1 can be modified to obtain the following. 


THEOREM 8.2. Suppose the product of the nonzero eigenvalues of T 1s not +1. 
Then g(or x I,or x oy) has infinite order. 


PROOF. Choose a prime p relatively prime to det% T and to det™ U. We first 
compute the gyration function g(or x I,or x oy)(p"). Now 


n 


Qpr (or X ou) = U {Qpr (or) X Qp (ou) U Qp (Or) X Qpr (ov) }- 


k=0 


If (x,y) E Qp (or) X Qpr (ou), then the smallest q > 0 for which (or x I)? (z, y) is 
in the ør x oy-orbit of (x,y) is q = p*, and this element is (x,y) itself. It follows 
that such orbits contribute 0 mod p” to the gyration function. Let 0<k<n-l1. 
For each orbit y E Qpr (or) pick z, € y. Then 


U {z7} x Qpr (ou) 
EQpn (or) 
is a complete set of orbit representatives for Qpr (or) X Qpr (oy). Since 
(or x I)(z, y) = (or x ou) (2,079) 


each representative contributes 1 to the gyration function. Hence 


n—-1 


glor x lor x ov)(p") = |Qpn(or)| >, |Qp«(or)| 
k=0 


a (tT —trT?""’) (tr UP"). 


Suppose K is a splitting field for xr (t) and xu (t) over Qp. Let T have nonzero 
p-adic eigenvalues A;,..., Aq, and let those of U be p1,...,He. Then by the above 


Pp" _ yer nA 
glor x lor x ou)(p")= X —+__t__ u ‘ 


n 
1,7 p 


The p-adic convergence of this expression follows as before. To obtain a nonzero 
limit, pass to rth powers to obtain A;” isi wr " > 1, and argue as in the proof 
of Theorem 8.1. O 

Theorem 2.9 of [BK] implies that if det* T # +1, then ør is not the product 
of finite-order elements in G. This is also a consequence of Theorem 8.1. The 
following, which generalizes Theorem 2.8 of [BK], shows ør is not even a limit of 
such products. 
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PROPOSITION 8.3. Suppose the product of the nonzero eigenvalues of or is not 
+1. Then or is not the limit, in the periodic point topology on G, of products of 
finite-order elements. 


PROOF. The periodic point topology was defined in §7 to be the coarsest making 
the restriction maps G — aut(Qn, or) continuous. From the proof of Theorem 8.1 
we can conclude there is a prime p and an integer m so that for all n > m we have 


g(or,or)(p") #0 mod p”. 


The result now follows exactly as in the proof of Theorem 2.8 of [BK] by using 
Proposition 2.7 there. O 

REMARK 4. All of the above results will hold even if det T = +1, provided 
the prime p used in the proofs can be chosen so that (8-1) is nonzero. As indicated 
in Remark 2, in specific case this can usually be done ad hoc. 


PROBLEM 8.4. Remove the restriction det” T Æ +1 from the above. 


The proof of Theorem 8.1 shows that g(or,o7r)(p”) converges p-adically for all 
primes p. It is also easy to see that if p has finite order, then g(y,or)(p") — 0 
in Qp. Thus for automorphisms p constructed by known methods, g(y, or)(p”) 
always converges p-adically. 


QUESTION 8.5. Does the gyration function g(y~, or) (p") converge p-adically for 
every p E aut(or) ? 


9. Compact invariant sets. In §7 we investigated G = aut(or) by studying 
its action on the compact G-invariant sets Q,(o7r) of points with least o7-period 
n. Are there other compact G-invariant subsets of X7? In this section we show 
that, modulo possibly a few orbits of low period, every compact G-invariant subset 
is a finite union of various Qn(or) or all of Xr. We also discuss the implications 
for the action of G on the compact space of subshifts equipped with the Hausdorff 
metric. 

Several constructions and proofs to follow are made clearer by the process of 
“passing to a higher order block presentation”, described as follows. Recall from §1 
that if T is r xr, then the graph of T has r states or nodes, and ye j=1 Tiz symbols 
or edges. Order the set £L of symbols arbitrarily. Let F(T) be the transition matrix 
for symbols, defined by E(T)zy = 1 if the terminal state of x matches the initial 
state of y, and 0 otherwise. Iterating this procedure yields the transition matrix 
E” (T) = E(E"~1(T)) for allowed n-blocks of symbols. Specifically, we define the 
n-block presentation of T to be T!"] = E*-1(T), where E°(T) = T. The symbols 
for T!"] are then the allowed n-blocks of ør, i.e. Lrin) = Bn(or). If B,C € Lrin, 
then Tir = 1 if B[l,n — 1] = C[0,n — 2], and 0 otherwise. Thus the symbols in a 
point x E€ Xr have simply been recoded in Xrin) by using n-blocks of symbols. 

There is a natural conjugacy Gy: Xr — Xyin) defined by 


(6,2); =zli-—k,t—k+n-— 1], where k = =| 


The metric on Xrin is understood to be that induced from Xr under n. With this 
convention, the maximum of the diameters of the sets B* = {y E€ Xrini: Yo = B} 
over BE Lrin tends to 0 as n > oo. 
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If K is a subshift of Xr, let B(K) be the collection of all blocks that occur in 
some point in K. For z € Xr, put B(x) = B(Sz). If B € Ba(Xr), put Bt = 
{xr E€ X7:2|0,n — 1] = B}. Here the reader should distinguish B considered as an 
n-block from Xr from B considered as a symbol from Xyp{n}. The homeomorphism 


Ên introduces a factor of oft /2] between the two meanings of B*. In particular, 


max{diam(B"): B € Lyin} ~0 asn—o. 


Thus if Y;,...,¥, are disjoint subshifts of Xr, for all large enough n the sets 
Lrini(Y;) of symbols from Lyin) occurring in Y; are disjoint. Furthermore, if Z C 
Xr is a shift of finite type, for all large enough n we can represent Z as 1-step on 
the symbols Lrin (Z). 

For completeness, we prove a standard technical result. A stronger version is 
contained in [DGS, 26.17]. 


LEMMA 9.1. Suppose B € B(Xr) and that B ¢ B(x). Then there is an infinite 
mizing shift of finite type Z C Xr with B€ B(Z) and Z disjoint from Uz. 


PROOF. Since our standing assumption is that ør is mixing, there are C} € 
B(Xr), J = 0,1, with relatively prime lengths, with BC; B € B(Xr), and such that 
B occurs only as the initial and terminal block in each BC;B. Form the shift of 
finite type Z C Xr of points of the form --- BC;_, BC;,, BC;, B--- with îi =0or 1 
for k € Z. Since the C}; have relatively prime lengths, or|z is mixing with positive 
entropy. Let n > 2(|Co| + |Ci| + |B|). Every block in B,(Z) contains B as a 
subblock, while B does not occur in z. Thus, by passing to a higher order block 
presentation, for large enough n, we can realize Z as a 1-step shift of finite type on 
an alphabet disjoint from the symbols occurring in z. Thus ZN “z=. O 


THEOREM 9.2. The automorphism group orbit Gx of x E Xr 1s dense if and 
only if x is not or -pertodic. 


PROOF. Since the period of a point is preserved under an automorphism, a 
or-periodic point z clearly has Gz finite. 

Now assume z is not ør-periodic. To show Gz = Xr, it is enough to prove that 
for every B € B(Xr) there isa p € G with B € B(ypz). If Be B(z), take p = I. 
Thus we assume B ¢ B(z), and find p. By Lemma 9.1, there is a mixing shift of 
finite type Z disjoint from Uz with B € B(Z). 

We first treat the case that a block in B(z) occurs exactly once in z. By passing 
to the n-block presentation of Xr, we can assume Z is a 1-step shift of finite type 
on an alphabet L(Z) C Lyin) = By(or) disjoint from the symbols occurring in 
z, that B occurs as a subword of b € £(Z), and that a symbol a € Lyin) occurs 
in z at a unique index, which by applying a power of orin) we can assume is 0. 
For the rest of this case, all symbols and blocks are from the n-block presentation. 
Let l be a transition length for Xrin) and for Z. Since both are mixing, there is 
a block E = aAbCbA'z[4l,6l] such that |A| < l, |A’| < l, a does not occur in A 
and occurs in A’ at most once, C € B(Z), and |E| = 61+ 1. We claim that E and 
F = z(0,6l] overlap only trivially in the sense of §2. Since a occurs only as the 
initial symbol in F while E contains symbols from Z never used in F, it follows E 
does not overlap F, and that F only overlaps itself trivially. The only nontrivial 
overlap of E with itself could occur with the occurrence of a in A’, but this is ruled 
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out since this would force z[4l, 6l] and C to have symbols in common. Thus, as in 
§2, the map p €E G defined by interchanging E and F and having no other effect is 
a well-defined involution. Since the symbol b from the n-block presentation occurs 
in yz, it follows that the original block B occurs in yz, completing this case. 

We next turn to the recurrent case. Here blocks in z can recur with distress- 
ing frequency, making the nonoverlapping condition for markers more difficult to 
achieve. The reader should keep in mind the Thue-Morse sequence, where every 
allowed block of length n recurs within 8n, to better understand the complications 
below. 

Let z, B, and Z retain their meanings, and assume we have already passed to a 
higher order block presentation with B a subword of b € £(Z) and L(Z)NL(z) = Ø. 
Choose symbols a,c E€ L(x) = B,(z) so that there are words aAb, bA'c € B(X7r), 
where A and A’ use no symbols from L(x), and so that some word aFc occurs in 
x. We claim we can assume there are arbitrarily long words of the form aFc in 
B(x). For if not, the longest such word could occur only once, and the previous 
case applies. Let l be a transition length for Xr and for Z. Adjusting z by a power 
of or if necessary, there is a k > 4l with zo = a and 2, =c, and a word C € B(Z) 
so that D = aAbCbA'c € B(X) with |D| = k+1. Note that the only symbols in D 
that are also in L(x) are the initial and terminal symbols. Let E = 2[0,k]. Since z 
is not o7-periodic, there is a p > k so that z[—p, p— Jj] # z|—p+ J, p] forO <7 < k. 
Define €: Bop41(X) — Bop41(X) by replacing each occurrence of D by E; if a block 
begins with a terminal segment of D, replace it with the corresponding terminal 
segment of E, and similarly at the other end. Since D cannot overlap itself except 
possibly in one symbol, € is well-defined. Note that €(z[—p, p]) = z[—p, p). 

Let M = €—1(z[—p, p]) MB, (X){D, E}Bp-x(X). Then M has the nonoverlapping 
property that if M, M' € M, then M[—p,p— J] # M'|—-p+ J, p] for 0 < j < k, since 
otherwise applying € would contradict the choice of p. Define p € G as follows. If 
y[—p, p] E M, then put 


E if y{0,k] =D, 


(ey) l0, k] = | D if y0, k] = E. 


Declare p to have no other action. Because of the nonoverlapping property of the 
words in M mentioned above, a symbol in y can be affected at most once by p, 
so p is well-defined. If y[—p, p] E M, then (yy)[—p, p] E M, proving that p is an 
involution. Finally, since the symbol b from the higher order presentation occurs 
in D, which is a subblock of (y2z)[—p, p], we see that the original block B € B(z), 
completing the proof. O 


THEOREM 9.3. Let or be a mizing shift of finite type, and define no(T) as in 
Theorem 7.2. If K is a compact G-invariant set, then either K = Xr, or K is a 
finite union of Qn,(or) with n; > no(T) together with possibly a finite number of 
orbits with periods < no(T). 


PROOF. If K contains a nonperiodic point, then K = Xr by Theorem 9.2. If 
t E KNQ, (or) for some n > no(T), then by Proposition 7.3 we have Qn(or) C K. 
If n; — oo, then Usa, Qn, (0r) is dense, so either K contains only a finite number 
of the sets Qn(or) for n > no(T), or K= Xr. O 
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REMARKS 1. Until the question of switching points with small periods, such as 
in Example 7.3, is settled, the possibility remains that for some n < no(T) the set 
Qn(or) could have proper G-invariant subsets. 

2. An irreducible shift of finite type is a finite tower over a mixing base shift of 
finite type, so Theorem 9.3 easily extends to this case. 

Let S$ denote the compact space of subshifts of (Xr,o7) equipped with the 
Hausdorff metric. In this context, two subshifts Y, Z € S are close if for a large n 
the sets B,(Y) and B,(Z) coincide. This is the topological analogue of the weak 
topology on o7-invariant measures used in the next section. From this description 
it is evident that G acts continuously on S. Theorem 9.3 identifies all but a finite 
number of the fixed points of this action. The result also shows that if Y € S$ 
is infinite, then Xr is in the G-orbit closure of Y in S$. We now generalize this 
argument. 


PROPOSITION 9.4. IfY C ZC Xr are infinite subshifts with Z mixing, then 
Z is in the G-orbit closure of Y in the Hausdorff metric. 


PROOF. Fix an n > 1, and enumerate the blocks of B,(Z) as By,..., By. Since 
or|z is mixing, there is B € B(Z) which contains each B}. Pick y € Y with infinite 
orbit. Let Z' C Xr be the shift of finite type so that B,(Z') = B,(Z). Then 
Z' is mixing since Z is. The construction of p with B € B(py) in the proof of 
Theorem 9.2 can be carried out with Z’ as the ambient space, so pY C Z’. Since 
each B; occurs in B, we can conclude that B,(~Y) = Bn(Z’) = B,(Z). Thus every 
neighborhood of Z in $ contains an image pY for some p E€ G, completing the 
proof. O 

It is more difficult to determine when a subshift Z in general position is in the 
G-orbit closure of Y. Two necessary conditions follow. 


PROPOSITION 9.5. If Y and Z are subshifts of Xr and Z is in the G-orbit 
closure of Y, then 


(1) h(or|z) = Alorly), 
(2) |Qn(orl|z)| = lQn(orly)| for n> 1. 


PROOF. Suppose y, € G with pY — Z. Since topological entropy is upper 
semicontinuous on S$, and h(or|y,y) = h(orly ), property (1) holds. Also, for fixed 
n we must have for large enough k that ox[Qn(orly)] C Qn(orl|z), from which (2) 
follows. O 


PROBLEM 9.6. Determine necessary and sufficient conditions on Y, Z € S for 
Z to be in the G-orbit closure of Y. 


Of course, Example 7.3 shows that this problem is not solved even for subshifts 
with a finite number of points. 


10. Orbits of measures. Let P(X7) denote the compact convex set of or- 
invariant probability measures on Xr, and M (Xr) be the o7-invariant nonnegative 
measures, each equipped with the weak topology. The group G = aut(or) acts nat- 
urally on both spaces, and the unique measure ur of maximal entropy is invariant 
under every p € G [CP]. Are there other continuous G-invariant measures? We 
show there are none, and obtain a complete characterization of the G-orbit closure 
in P(X) of a continuous probability measure. 
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To state our result, let h(u) denote the entropy of or with respect to u E M(Xr). 
If Æ% is the partition of Xr by the Oth coordinate, then 


4 
h(u)= lim = = = (A) log (A). 
AEV’ Zo or’ Po 


Observe that this formula applies even to nonnegative measures with total mass 
different from 1, and that if r,s > 0 and u,v € M(Xr), then [W, Theorem 8.1] 
h(ru + sv) = rh(u) + sh(v). We shall use a version of the ergodic decomposition 
suited to our needs. For u € P(Xr) there is a measurable function t + ut from 
(0, 1] to P(X) so that each ut is ergodic under ør, if E C Xr is Borel then (E) is 
measurable in t with u(E) = is ut(E) dt, and h(") is decreasing in t. Note that the 
ut need not be distinct. Indeed, if u is already ergodic, then pt = p for a.e. t. All 
but the entropy statement follows from the standard ergodic decomposition [DGS, 
Chapter 13]; h(u*) can be arranged to be decreasing by a measurable rearrangement 
of [0,1]. A limiting form of the linearity of h mentioned above shows that if E C 
(0, 1], then h( fp ut dt) = fgh 

Define the cumulative entropy H,(t) = Ss h(u°)ds for 0 < t < 1. This function 
can be defined without reference to the ergodic decomposition by 


H,„(t) =sup{h(v): v E€ M(Xr), v < u, v(Xr) =t}. 


For ergodicity of the % if v < r en v(Xr) = t, then v = fo pls u°? ds with 
Jo 0(s)ds = t. Thus h(v = fo ol °) ds which, by monotonicity : hi t), has 
ed ae value H,,(t) a pis i ee function of [0, t]. Our characterization 
of orbit measures is the following. 


THEOREM 10.1. Letor be a mizing shift of finite type on Xr, let G = aut(or), 
and suppose p E P(Xr) is continuous. Then v is in the G-orbit closure of u iff 
H, < H, on (0,1). 


COROLLARY 10.2. The measure ur of mazimal entropy for or is the only 
G-invariant continuous probability measure on Xr. 


PROOF OF THE COROLLARY. Suppose u € P(Xr) is continuous and G- 
invariant. Then h(u*) < h(u$%) = h(or) for all t, so H, < Hy,. Thus ur € 
Gu={u}. O 

Note that if u and v are ergodic, the condition of Theorem 10.1 reduces to 
h(u) < A(v). 

We will first prove the necessity of the condition H, < H, by using upper 
semicontinuity of entropy. The main work is to prove its sufficiency. We first find 
in Lemma 10.3 a partition 0 = to < tı <: < tg =1 so that if Ją = [te i, tk), 
a, = |J,|, and uk = Ti Si, ut dt, then u is a convex combination aa 1 ÆkHk, 
where the ux are mutually singular and h(u$.) is almost constant. We then use the 
Convex Approximation Lemma 10.4 to perturb v to v = pee ækĎg, SO that the 
Ük are mixing Markov measures with disjoint supports and h(k) > h(u,). Using 
a Rohlin stack argument in Lemma 10.5 based on a marker, and assuming that 
almost every yu’ has full support, we construct a marker automorphism in Lemma 
10.6 mapping u close to v that does not affect a preassigned subshift. The case when 
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u does not have full support, involving difficulties similar to those in the second 
half of the proof of Theorem 9.2, is treated in Lemma 10.7. Using a decomposition 
from Lemma 10.8, the pieces of the proof for sufficiency are, at last, assembled. 
PROOF OF THEOREM 10.1 (NECESSITY). Suppose u € P(Xr) is continuous 
with ergodic decomposition p = J u° ds, and gy, E G has pnu —> v weakly. Fix 


t € [0,1] and put p’ = Jy u° ds. By compactness, a subsequence of pnu’ converges 
to some v’, where v'(Xr) = t and v’ < v. By the intrinsic definition of H, given 
above, we get h(v’) < H,(t). Using the upper semicontinuity of entropy [W, 
Theorem 8.2] and that h(y’) = h(pnp') for all n, we have 


H(t) = h(p') = lim sup h( pny’) < h(v') < H,(t) Oo 


n— 0O 


We begin building the machinery to prove sufficiency. For uw € M(Xr) let 
supp(jz) denote the complement of the largest open p-null set. 


LEMMA 10.3. Let u,v E€ P(Xr), let Z C Xr be a proper subshift, and suppose 
h(v) > h(y). For every neighborhood V of v in P(Xr) and £ > 0, there ts a mizing 
Markov measure Ù E V with |h(v) — h(v)| < €, supp(v) NZ = Ø, andv L u. 


PROOF. Let V be a neighborhood of v in P(Xr) and € > 0. Replacing T 
by a higher order block presentation if necessary, there is a 6 with 0 < 6 < € so 
that if |v’(a*) — v(a*)| < ô for all a E€ Lr, then v’ € V. Since Z is proper, there 
is a D € Ba(Xr) so that D* N Z = ©, where we continue to use the notation 
D* = {x € Xr: z[0,d — 1] = D}. Let l be a transition length for T, and let y > 0 
whose value will be determined later. By the ergodic and Shannon-McMillan- 
Breiman theorems, there is an m large enough so m/(21/+d+m-+1) > 1-6/3 and 
collections C; C B;(Xr) (gj =m,m-+1) so that 


(10-1) exp[m(h(v) — 7)] < |C;| < exp[mh(v)] 
and 
(10-2) l|fa(C) — v(a*)| < : for all C E€ Cm UCm41, 


where f,(C’) denote the frequency of the symbol a in C. For each C € C = 
Cm U Cm+1 choose Ao, A; E€ Bı(Xr) with C’ = ApC'A; so that DC'D e B(X7r). 
Form the collection C’ of such blocks. Let Y be the shift of finite type with the 
subblocks of concatenations of elements of DC’ as allowed blocks. Clearly Y is 
topologically mixing since m and m+ 1 are relatively prime. Let Y be the measure 
of maximal entropy on Y. We show first that |h(v)—h(v)| < € and supp(V)NZ = ©, 
then perturb T to get singularity with respect to yp. 

First note that Y C Ua” o4 (D*) and Zna% (D*) = Ø for all j, so YNZ = Ø. 

Let v’ be any o7-invariant measure with v’(Y) = 1. By the ergodic theorem, 
there is a y € Y so that y[0,d — 1] = D and |fa(y[0,n — 1]) — v'(a*)| < 6/3 
for all a E€ Lr and sufficiently large n. Pick n so that y[0,n — 1] has the form 
DC, DC y---DC,, C; E C'. Then k > n/(2l +d + m), and if C; has central block 
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C; then 
k 
nfa(y[0,n — 1]) — nv(a*)| < (2d + 1)k +$ |C; (fa(C;) — v(a*)) 
j=1 
2l +d 6 2 
da (oe eee —n < —6n. 
= (asin) "+ go get 
In particular, |D(a*) — v(a*)| < 6 fora E Lr, proving T E V. 
Next we estimate h(V) = h(orly) by estimating |B,(Y)|. Every E € B,(Y) has 
the form E = ADC, DC, --- DCB, where |A|, |B| < 21+d+m, C} E€ C’, and 
een | ay a 
2i+d+m+1 ~ ~2+d+m 
Making independent choices of the C}, it follows from (10-1) that 


[Bn(Y)| > (exp[m(h(v) — 7)" 
= Ri exp (ar) mn) - a) 


> K] exp |(1- E) mh w) =a), 
where xı > 0. Hence 


oes ol 6 
horly) = lim = log|Bn(¥)| 2 (1-3) (he) -= 9) > Al) = 
for y small enough. Similar estimates show 


IBn(Y)| < |Bor+a+m(Xr)|? (exp[mh(v)]) ° 


< k2 exp (a) nh(v) 


for suitable x2 > 0, showing h(orly ) < h(v). Thus h(v) —e < h(T) < h(v). 

We have hence found a mixing Markov measure T €E V so that |h(V) — h(v)| < € 
and supp(V)NZ = ©. By perturbing the transition probabilities of 7, we can obtain 
a continuous family {Ta } of mixing Markov measures each supported on Y and with 
Vo = V. For small enough a the Da remain in V and have |h(Ta) — h(v)| < £. Now 
the Ta are ergodic and distinct, so they are mutually singular. It follows that all 
but countably many of the Da are singular with respect to u. Set v to be any of 
these. O 

If u has ergodic decomposition u = J u* dt, define hoo (u) = ess supo<t<i A(p’). 

CONVEX APPROXIMATION LEMMA 10.4. Let u,v € P(Xr) with H, < H, 
and hoo(p) < h(or), and let Z be a proper subshift. For every neighborhood V of v 
in P(Xr) there is a Ò =) >), ake E V and a corresponding convex decomposition 
u = Dan Ape, where a, > 0 with Siar a, = 1, such that the uk and Ùk are 
all mutually disjoint probability measures, the Vy, are mizing Markov measures with 
disjoint supports that are also disjoint from Z, and hoo(uK) < h(i). 


PROOF. Let u = h ut dt and v = T vt dt be the monotone ergodic decompo- 
sitions as described above. Put @ = h(or) — hoo(u) > 0. Choose € > 0 so that 
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if v' € P(Xr) and ||v — v'|| < 2e, then v’ E€ V. By partitioning [0,h(or)] into 
intervals of length < ¢f/2, we can find 0 = to < ty <--- < tk-1 < tx = 1 so that 
if Je = [te-1,th) and we =|Jx|~* J, ut dt, then the ug are mutually singular and 


(10-3) sup {h(u")} — inf {h(u')} < 68/2. 
te J te 


Put a, = |J;|. We now inductively define vg € P(Xr) so that v = ae pV, and 
h(vx) > h(uk). Let 


8+t tı 
Sı = sup fs < l-t: / nr’) at > | h(ut) at}. 
s 0 


Since H, (t1) > H,(t1), the set defining sı is nonvoid. Put vı = ty’ ae vt dt. By 
definition h(v1) > h(i), and we claim Hy_,, > Hy—», on [0,1 — tı], which allows 
inductive construction of all the vk. To prove the claim, first suppose sı = 1 — tı. 
Then using monotonicity 


iPr 
a) a t 
ae inf th(v \} > a MN h(v") dt 
Le t 
>] h(w')dt> sup fth(u')}, 
1 Jo ti<t<1 


so Hy», > Hy-—p,, follows trivially. Now assume sı < 1 — tı, so by continuity 
SET n(vt) dt = fo" h(ut) dt. If u < sı, then 


| Ripe J ETE [ ” h(ut) dt 


tit+tu 
> J h(ut) dt = Hp-p (0), 


tı 
while if sı < u < 1 — t, then 


tı+u sı+tı 
He (ae J ni) ae J h(v*) dt 


31 


tı +u ti 
> J h(u*) dt — J h(u*) dt = Hp-p, (u), 
0 0 


verifying our claim. 
For each k let vk = Io vt dt be the ergodic decomposition, put vy, = fy “vi dt+ 


Er, and sum v’ = eae v. We show h(v,) > hoo(ux). For by (10-3) 


J- h(v;,) dt + eh(or) 


=E 


iS 


( 
> h(uk) — Ehoo(ue) + Eh(or) 
> Noo(Mk) + €B/2 > Roo (Hk). 

Since ||v — v’|| < 2e, it follows by our choice of € that v’ € V. Finally, by Lemma 
10.3 we can inductively modify each vj, to Ďķ so that v = pea 1 kx satisfies the 
conclusions. O 

If F C Xr and z € F, define rp (x) = min{n > 0: of2 € F}, and put Fn = {re 
F :rp(z) =n}. For a collection C C B(Xr) let C* = Ucec C”. 
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LEMMA 10.5. Suppose uk E P(Xr) for1 < k < K are mutually singular, 
ande > 0. There is an M = M(e, mı,..., ug) so that if F is compact open and 
N > n > M, then there are collections Cn(uk) C Bn(Xr), disjoint inn and k, such 
that |Cn(ur)| < expln(hoæl(uk)+€)| for1<k< K and 


N 


SO Me (Fn N Caluk) =D male 


n=M 


PROOF. The rough idea is to use singularity of the ug to first find disjoint 
collections Dk of atoms having the right exponential size and containing most of 
the uķk-mass. Using the idea that a partial orbit can intersect at most one of these 
atoms with frequency > t, we obtain from the D, the required disjoint collections 
Cn (Uk). 

Since the yw, are mutually singular, there is an s and disjoint collections Dk C 
Bos+i1(Xr) such that u,(Df) > 1 — €/12 for 1 < k < K. Applying the version of 
the Shannon-McMillan-Breiman theorem from [DGS, Theorem 13.4] to or and to 
On for all large enough M and 1 < k < K we have 


E€ E 
uk(E} ) = uk |a sup —— log ux (20 m — 1]*) < hoo(u Dfl Pi 
g m>M/6 
= E E€ 
Me (Ey) = Hk fa sup ~~ log pe (2[-m + 1,0]*) < hoo (uk) + | et 
m>M/6 


Let Ey, = E} NE; and fix M so that up(Ek) > 1—e/6, and also satisfying M > 6s 
and 


(10-4) Z exp oo (He) + (n+ 1)=| < expļ|ne] 


for n> M/6and1<k<K. 
Suppose now F is compact open, and M < n < N. If z € F, with o4z € Ex for 
some n/3 < j < 2n/3, then 


pr (2[j,n — 1]*) > exp [-(n = 3) (holur) + 5) 


The number of blocks z[j,n — 1] arising from such an z is thus bounded above by 
exp [(n — 7) (hoo (uk) + €/2)]. Similarly the number of blocks 2z(0, 7] from such an z 
is < exp [(j + 1)(hoo (ue) + €/2)]. It follows using (10-4) that the number of blocks 
z[0, n — 1], where z € Fn and o7,z € Ep for some n/2 < j < 2n/3 is bounded above 
by 


= exp (n +1) (hoo (ie) + 3] < exp [n(hoo (ue) + €)]. 


Now let Cn (up) denote the collection of blocks z[0, n—1], where z € Fn, n, E€ Ek 
for some n/3 < j < 2n/3, and 


2 


e E alj- sj +s] € Da} P 


6 


Since the Dk are disjoint, clearly the Cn (uk) are disjoint in k and trivially in n since 
the lengths differ. The discussion above proves |Cn(uk)| < exp[n(hoæ(uk) + £). 
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Finally, since 


nup{z E Fn: t[0,n — 1] £ Cn(ux)} 


2n/3 2n/3 
< 3uk | EEN LU of Fn | +6ue | Den LJ of Fn |, 
j=n/3 j=n/3 
we have 
N N 
XO nuk(FnOCa(ue)") > XO nur(Fa)— 3uk(E£) — 6u (Df?) 
n=M n=M 
N 
> > nuk(Fn) =g O 
n=M 


LEMMA 10.6. Suppose p,v E€ P(Xr) with H, < H,, that u is continuous, 
and that supp(y") = Xr for a.e. t. Let Z be a proper subshift of Xr and V be a 
neighborhood of v in P(Xr). Then there is an involution p € aut(or) such that 
puEV and ¢ is the identity on Z. 


PROOF. We first reduce to the case holu) < h(or). Let to = sup{t: h(ut) = 
h(or)}. Since ør is intrinsically ergodic, for 0 < t < tg we have ut = vt = pr. 
Since ur is G-invariant, any p € G, in particular the p we construct below, maps 
fo ° ut dt = tour to i ° yt dt. Removing this part from each, we are reduced to the 
case h(u*) < h(or) for all t. Since it is enough to map (1 —«)7! fc ut dt to V for 
a sufficiently small € > 0, we can and do assume ha (pu) < h(or). 

Using the Convex Approximation Lemma, there is ù = ioe anv, E€ V and 
u= pas Ap, With a, > 0, pie a, = 1, so that the uw, and yy are all mutually 
disjoint, the vy, are mixing Markov measures with supports Yx that are mutually 
disjoint and disjoint from Z, and hao(ux) < h(x). 

By passing to a higher block presentation, we can assume that the orly, and 
or are l-step, Ly, and Lz are all disjoint, and that there is a 69 > 0 so that if 
|v’(a*) — v(a*)| < ĉo, then v’ E V. Let 6 > 0, which will eventually be made small 
enough for everything which follows to work. The first requirement on 6 is that 


(10-5) (1 — 6)[A(Ye) — 6] > hluk) + ô 


for all k. 
By the ergodic and Shannon-McMillan-Breiman theorems, there exists an Mo 
so that for n > Mo there is a collection Cn (Dk) C Bn(Y,) so that 


|Cn(H)| > exp[n(h(Yx) — 4)], 
Dk (Cn(Ùk)*) > 1-4, 
and 
(10-6) lfa(C) —%(a*)| <6 foralla€e Lr. 


There is a block Ao € B(Xr) so that A§ N (Z UU% Ye) = Ø. Since the number 


of blocks in B,(X7) beginning with Ap grows exponentially in n, and each py has 
full support, there is a block A beginning with Ag so that 0 < p,(A*) < 67/4|A| 
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for 1 < k < K. Let |A| = d and put M = [d/46]. Let l be a transition length for 
each or|ly, and for ør. We may assume d is large enough so that M > Mo, that 
M > M(6, 41,..-,; uK) from Lemma 10.5, and that n—2d—4l > (1—6)n for n > M. 
Since supp(uk) = Xr, there is an N > M so that uk (Cae l o LA") > 1-6 for 


1<k<K. Let F=A*. If Fa = {z E€ F: rp(z) =n}, then 
È malh ) > 1— My;(A*) —6 > 1-26. 


By Lemma 10.5, Poin are mutually disjoint collections Cn(uk) C Bn(Xr) so that 
|Cn(uk)| < exp[n(hoo (He) + 6)] and 


N 
(10-7) Do MH (Fn Cn (u)”) > 1- 36. 
n=M 

We now construct p. For each k choose a block Ax so AAx E B(Xr) has minimal 
length such that it ends with a block from Bg(Y;,). Note that |A,| < d is possible. 
Clearly |A,| < 2d + l since L is a transition length for or. Similarly choose By so 
B,A € B(Xr) and has minimal length for such a block starting with a block from 
Ba(Yk). For each C E Cn-3a-41(ŬÙx) there is a C” containing C as a subblock so 
that AAC" BKA E€ Bn+a(Xr), and so that A occurs only as the initial and terminal 
d-block. Form Dy, = {AAkC" BA: C E Cn—3a—41(Y~)}, sO 


lDnk| = |Cn—3a—41(U%)| > exp[(n — 3d — 41)(h(D,) — 6)] 
> exp[n(1 — 6)(h(ix) — 6)] 
> exp[n(hoo (Hk) + 6)] > |Cn(ux). 
Thus it is possible to define a permutation Ôp of ABn-a(Xr)A so that 6? =f, 
Onk(Cn(uk)A) C Dnr, and Onx is the identity off Cn (uk) AU bnk(Cn(uk)A) Note that 
each block in Cn(ug) begins with A, so these 0,, are exactly the type introduced 
in §2, where here A is the marker block. 

If z E€ Xr with z[i,i + n + d] E Cn(uk)A U bnk(Cn(uk)A), where M <n< N 
and 1 < k < K, define (yx)((i,2 +n + d]) = 6n4(2[2,2 +n + d]). Since the Cn(up) 
are disjoint in n and k, and 62, = I, it follows that p is a well-defined involution. 

We complete this proof by showing that |pu;(a*) — %(a*)| < ôo for each k and 
a E€ Lr. This will imply that |gy(a*) — v(a*)| < 69 for a E Lr, so pp E V, the 
required conclusion. If C € Cn(uk) then 


anor N U oh (F, n N Onk (C »] =nfa (Onk(C)) Our (Fn N Onk(C)*) 


Sifa (Onk(C)) Ue (Fn Nc). 


Letting 
N n-1 


= U U LU ot. (Fn N Onk(C)*), 
n=M 7=0 CECn(ur) 


then (10-7) above shows that yu,(E) > 1 — 36 while by (10-6) |py,(a* N E) — 
V,(a*)| < 26. For 6 sufficiently small this forces |pyu;,(a*)—,(a*)| < ôo , concluding 
the proof. O 
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The following result deals with the case that u does not have full support. Com- 
plications arise because may be highly recurrent, so that constructing markers 
with the necessary disjointness is more difficult. To appreciate the problems, an 
excellent example to keep in mind while reading the proof is for u to be the unique 
invariant measure supported on the Morse minimal set and v to be the measure of 
maximal entropy on Xj). In this sense, the difficulties parallel those in the second 
half of the proof of Theorem 9.2. 


LEMMA 10.7. Suppose u,v E€ P(Xr) with H, < Hh, that u is continuous, and 
supp(u) Æ Xr. For every neighborhood V of v in P(Xr) there is an involution 
p E aut(or) with pueV. 


PROOF. Let Z = supp(u) # Xr. Since supp(u’) C Z for a.e. t, it follows 
that ho(u) < h(or|z) < h(or). By the Convex Approximation Lemma, there is a 
p= D ap, E V and w= pia OU, Where ay > 0 with D a, = 1, the uk 
and vy, are all mutually singular, each 4 is mixing Markov supported on a mixing 
shift of finite type Y, with the Yp and Z mutually disjoint, and ho(uR) < h(Ùk) 
for all k. 

By passing to a higher order block presentation, we may arrange Ly, and Lz 
to be disjoint, or|Y;, and or to be 1-step, and for there to exist a 69 > 0 so that if 
|v'(a*) — v(a*)| < ĉo, then v’ E V. Introduce 6 > 0, which will eventually be made 
small enough for the following to work. 

Before starting the main argument, we first recode ør so that for each a E€ Lz 
and all k there are “escape” blocks Ag, and Aka so that Ag, starts with a, has 
no other symbol in Lz, and ends with a symbol from Ly,, while Aka has these 
properties in reverse order. To arrange this recoding, first choose Aj, starting 
with a, no other symbol is a, and ending with a symbol from Ly,. Choose Aj, 
to have these properties in reverse order. Introduce entirely new distinct symbols 
bak, Oka £ Lr for all a and k. Form a new shift of finite type from Xr by replacing 
all symbols in Aj, except the first and last by the same number of bax’s, and 
similarly with the A}, and bka’s. This recoding of Xr has the property sought, 
but is not 1-step. Pass to a higher order block presentation so the resulting shift is 
1-step, and the existence of the escape blocks as above is preserved. 

By the ergodic and Shannon-McMillan-Breiman theorems, there is an Mo so 
that for n > Mo and all k there is a collection C,(v%,) C Bn(Y,) so that 


Dk (Cn(ie)*) > 1-6, ICn(VK)| > exp[n(h(Y,) — 4)], and 


(10-8) |fa(C) —(a*)| <6 forall CEC,(%) anda €E Lr 


Let Mı = M(6, 4,...,4«) from Lemma 10.5, and let l be a transition length for 
or and each o7ly,. Now fix M > max{4l/6, Mo, Mı}. For n > M, each c,d E Lz, 
and C € Cn—4i (Ux) there is a block AckC' Akpa E Bn(Xr) with C a subblock of C”. 
Form the collection Dng(c, d) of such blocks. Note that each D € D,x(c,d) starts 
with c, ends with d, has no other symbols in Lz, and |fa(D) — %(a*)| < 26 by 
(10-8). Also, 


(10-9) |Dnk(c, d)| = |Cn—a1(M%)| > exp[(n — 41)(h(%) — 4)]. 
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We now construct p. Recall that C € B, (Xr) is called j-periodic if C[0,n— J] = 
C[j, n]. Since u is continuous, there is a p > M so that for all k, 


Lk (Uier: C € B2p+1(Xr) is j-periodic for some j < m}) < ô. 


By [Kr1] or [B1, Lemma 2.2], there is a compact open set F C Z such that 
{ol F: 0 < j < M} is disjoint, and 
2M—-1 
{x € Z: z[—p, p) is not j-periodic for all 7 < M} C U o4 F. 
j=0 
Recall that rr is the return time function for F, and put Fn = {x E F: rr(z) =n}. 
By the above we have 
2M—1 
(10-10) y nuk(Fn)>1-— ô. 
n=M 
Since F is compact open, there is a q > p so that F is a union of sets o,7B* for 
Be Bog+1(Z). Let 


F = {A € Bogsamai(Z): op 9) A C FY. 
Since M > M(6, 44,..., uk) from Lemma 10.5, choose disjoint collections Cn (uk) C 
B,(Z) for M<n< 2M and all 1 < k < K so that |Cn(ux)| < exp[n(ho(ux) + ô) 


and 
2M—1 


3 nur (Fn N Cn(Hk) ) > 1-26. 
n=M 
Thus for 6 small enough |Cyn(ux)| < |Dnk(c, d)| for all c,d € Lz. It follows there is 
a injection 
On: > VU Dnk(c, d) 
c,dELzZz 
that fixes the first and last symbols. 

Now define €: B(Xr) — B(Xr) to replace any occurrence of 0n (C) by C, where 
M <n <2M —1 and C € Cn(uk). This map is well-defined because blocks from 
Dnk(c, d) can only overlap in the end symbols, and these symbols are fixed by ôn. 
Let 

E = {E[2M,2q¢+6M + 1]: E € €&€! (B2m(Z)FB2m(Z))} 

and put E = gm ee We claim that {ol E: 0 < j < M} is disjoint. For if 

= o7y for z,y € E and some 0 < j < M, applying £ to z[-q — 2M,q + 2M] and 
y[—q — 2M — j,q + 2M — j] shows this would contradict disjointness of {0}. F : 0 < 
j < M}. The mapping ¢ is now defined as follows. If o42 € E and grami EE 
for M < n < 2M, then 

(pr)li, i +n- 1] = i a : n— 1), y ee sake gare 
0n (zlii +n -— 1)), if zfi,i +n -— 1] € On(Ca(ue)). 

Because {04 : 0 < j < M} is disjoint, p is well-defined. The definition of £ is made 
so that z[-q — 2M,q + 2M] € € iff (px)[-q — 2M,q+2M] € £. Hence p? = I. 
Finally, the estimate that |pu(a*) — v(a*)| < ĉo for all a € Lr follows from (10-8) 
and (10-10) exactly as in the previous lemma. O 
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LEMMA 10.8. Let u,v E€ P(Xr) with H, < Hy, let E C [0,1] have positive 
measure, and set uo = fe ut dt, pı = u — uo. Then there is a decomposition 
v = W% +v, with H,, > Hp, forj =0, 1. 


PROOF. Let a be the measure of E, and put 
uo = sup{u < 1 — a: H,(t+u) — H, (u) > Hpo (t) fr 0 <t <a}. 


The defining set clearly contains 0, so is nonempty. Put vo = fore vt dt and 
vı = v — vo. By definition, for 0 < t <a, 


H(t) = H(t + u) — Hv (u) > Hyo(t): 


We now show H,, > H,,. If uo = 1 — a, then since A(y*) is decreasing in t, for 
0<t<1-—a we have 


(10-11) H,, (t) = H, (t) > Hult) > Hy, (t). 


Now suppose up < 1 — a. Then by continuity 


uota 
(10-12) J Wears f h(ut) dt. 

uo E 
If t < uo, then (10-11) shows A, (t) > H„ (t). If uo < t < 1 — a, then using 
(10-12) we obtain 


uo+a 


H, (t) = H,(t + a) — J h(v*) ds 


uo 
> Hult +a)- | h(u°)ds > H, (t) D 
E 


PROOF OF THEOREM 10.1 (SUFFICIENCY). Suppose p is continuous with 
H, < Hy, and that V is a neighborhood of v in P(Xr). There are € > 0 and 
a neighborhood Vo of v in M(Xr) so that if v € Vo and v” € P(Xr), then 
(1 — e)v' + ev") /[(1 — e)u' (Xr) +e] E V. 

Since every proper subshift is contained in a proper shift of finite type, and there 
are only countably many of the latter, there is a proper subshift Z C Xr such that 


m(Eı) = m({t € [0,1]: supp(u’) ¢ Z,supp(u") #4 Xr}) < €, 


where m is Lebesgue measure. Let Eo = {t € [0,1]: supp(u") C Z} and Ep = {t € 
[0, 1] : supp(u*) = Xr}. These sets are clearly measurable. If m(Eo) = 0, then by 
Lemma 10.6 all but € of u can be mapped to a measure in Vo, and we are done. 

Thus suppose m(Eo) =a > 0. Let u; = f E, ut dt. Two applications of Lemma 
10.8 show that there is a decomposition v = vo + vı + v2 with Hy, 2 Ay, for 
0<j< 2. By Lemma 10.7, there is a pı E€ G so that po is close to vo. By 
Lemma 10.6, there is a p2 E€ G that is the identity on pı(Z) and that maps pe 
close to v2. It follows that the p; can be chosen so that p21 Maps Ho + u2 into 
Vo, and since u2(Xr) < £, we obtain finally (p291)uEV. D 


11. Problems and questions. During the course of this paper we have in- 
dicated several open problems and questions. Two of these seem to us the most 
important. Recall that Fn is the set of points in Xr with period < n. 
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QUESTION 6.10. Is the kernel of the dimension representation generated by 
elements of finite order? 


QUESTION 7.1. When is an automorphism in aut(Fn,or) the restriction of 
one in aut(Xr, or)? 


Several others seem to be particularly interesting. Problem 3.3 asks whether 
there is an automorphism of infinite order with nth roots for infinitely many n. 
This is basic to understanding the kind of divisibility present in G. Our lack of 
knowledge about the algebraic structure of G is pointed out in Question 4.1, which 
asks for a nontrivial case of two shifts of finite type having isomorphic automorphism 
groups. In particular, are the automorphism groups of the 2-shift and the 3-shift 
isomorphic? Problem 6.1 asks whether the dimension group representation 6 of G is 
always surjective. The p-adic behavior of the gyration function is quite interesting. 
Specifically (Question 8.5), does g(y,o7r)(p") always converge p-adically for every 
p E€ G? Problem 9.6 generalizes Question 7.1 above to characterizing the orbits of 
subshifts of Xr. 
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